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When it comes to building, deploying, and managing 
your database solutions, SQL Server 2000 is designed 
to be fast. How fast, you ask? Way fast. 


We’re talking turbo-charged, rocket-powered, 
pedal-to-the-metal fast. It’s quick it jams 
it zooms it hauls. 


fegisisied ademarks of trademarks of Microsoft Corporation in the United States and/or other countries. 
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Introducing Microsoft SQL Server 2000. Okay, that headline took awhile. 


But that’s only because there are so many build-faster deploy-faster 


manage-faster improvements in SQL Server 2000. Improvements like rich XML 


Mi support that lets platforms 
SOL work together. And Web- 
i Edition enabled Analysis Services 


such as data mining and OLAP so you can analyze trends and make 


predictions faster than ever before. Find out more at microsoft.com/sql 
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OneSource™ simultaneous round-trip engineering! Class diagrams 
and source code, always in-sync, always up-to-date, giving you 
complete freedom to work visually or textually (Java, C++ or IDL) as 
you see fit. Plus robust programming editor with syntax highlighting 
and auto-completion features; compiler with messages auto-linked 
to diagram element, source-code line, and property editor; and 
distributed, multi-threaded debugger. 








EJB features: CF EntiyBear : 

¢ Generate, configure, ee an 

reconfigure, saccoutKeyine primary Key 

customize, and | sunsetentiyContexto-vowd 

compile eomenam 

. . ae ne ~ Methods s 

e Visually edit your parebieleiariiag : 

. +ejbStored aid : 

create, finder and jbLoadd)woid 

business methods in —_|_»ttivcontextjavaxeit.Entivconter pects 

any EJB | * | ~ Create methods | 

= +ehCreated veld : 

° eat ti updates | cpposicreston vo 

ira 5 shel ~ Floder methods 

interfaces and | + findiyPriman/Key(ok:AccountPR) Account 

primary-key classes eee » Business methods _ 

j | +depostawaid E 
(hidden from view, |. 
unless you choose sbalanced.void 


. 


e Round trip EJB-JDBC, ERD-JDBC, and JSP-JavaBean support 
e Generate, configure, reconfigure and customize GoF patterns 


e Add your own company-specific patterns (express simple patterns 
in text, complex in Java) 
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private int usegekuies; 
private bigdecimal costPertnitotiag; 





BigPlay™ team support is just a mouse-click away! Complete end-to- 
end multi-user version control for models, source and documentation 
across the enterprise (using CVS or whatever SCC-compliant version 
control system you prefer). 
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UML 1.3 Nine kinds of diagrams! Reverse engineer any source code 
into sequence diagrams, ERDs, too. Plus Raftmaker™ customizable 
multi-level doc (Javadoc 2 superset + RTF) can be invoked as part of 
your daily build. Deliver always up-to-date doc to your boss! 


Download Together Whiteboard at www-ogethersoft.com/ddj 


“Having seen Together’s direct use of Java code to support the models, | am forced 
to wonder why anyone would like to work in any other way! ...it's a complete solution, 
not just a modeling tool.” —Jon Collins, Senior Analyst, Bloor Research 


“We discovered in our evaluation that other modeling tools only get used during the 
front end of a project. Yet we found Together to be the one UML modeler that our 
developers will use from start to finish. It's the only one that delivers simultaneous 
round-trip engineering (rather than batch mode). Why?”——Curtis Chambers, The 


Home Depot 


“Thanks for your wonderful product! | wonder how you guys find the time to 
incorporate so many additional new features in the short span of time between 3.0 
and 4.0. | am, as a power user of your product, very happy to see your progress.” 


— Suresh Balamaran, TimeO 
©2000 TogetherSoft Corporation, all rights reserved. Not all features are available in all editions. 


Together, OneSource, BigPlay, and Raftmaker are trademarks of TogetherSoft Corporation. Java, JavaBean, and EJB 
are trademarks of Sun Microsystems, Inc. 
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FEATURES 


THE JAVA SECURE SOCKET EXTENSIONS 

by Kirby W. Angell 

The Java Secure Socket Extension package and Java Run-Time Environment provide most of the 
tools you need to implement SSL within Java applications. 


THE JUNIT++ TESTING TOOL 

by Siegfried Goeschl 

JUnit++ is a freely available Java unit test framework that includes a test data repository, 
command-line arguments, and a TestRunner class that supports a built-in repetition counter and 
multithreading at the command line. 


TAPE DEVICES & JAVA 
by Chad Gibbons 
Since native support for tape devices doesn’t exist in Java, Chad had to build his own tape library. 


J.A.D.E.: THE JAVA ADDITION TO THE DEFAULT ENVIRONMENT 

by Jean-Marie Dautelle 

The Java Addition to the Default Environment (or “J.A.D.E.” for short) is an open-source project 
that fills gaps in the JDK core library. 


A JAVA 2 NETWORK CLASS LOADER 

by Lorenzo Bettini and Donato Cappetta 

Lorenzo and Donato present NetworkClassLoader, a class loader that lets you load classes 
from remote servers. 


MOVING UP TO 64 BITS 
by Arch D. Robison 
The migration from 32-bit to 64-bit machines is really a test of how “clean” your code is. 





C# VERSUS JAVA 

by Marc Eaddy 

Six of one and half a dozen of the other? Marc finds out when he examines how Photography by Paul Haggard. Special 
Microsoft's C# differs from Java. thanks to Henry's on Eighth. 





EXPLORING PERL LIBRARIES 84 
by Robert Kiesling 
Among other things, the Perl modules Robert presents here let you view library module data 
Vad la) =} & Gs 1} within the Perl interpreter. 
As: JAVA AND THE WABA TOOLKIT 92 
by Al Williams 
. . With Java and the Waba toolkit, you can develop powerful applications for handhelds like the 

Hesgn Methoaoges PalmPilot or Windows CE-based systems. 

Deconstructing Microsoft’s INET 
INTERRUPT SCHEDULING 104 

XFree86 4.0.1 by John Pote 


This interrupt scheduler was designed to be flexible enough to accommodate more handlers if 
and when they’re needed, or if priorities change. 
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EMBEDDED SYSTEMS 


MODULE DESIGN GUIDELINES FOR REAL-TIME SYSTEMS. 108 


by David Janik 

A common problem faced by programming teams is producing consistent, 
reliable, and maintainable modules. David offers guidelines to address 
this issue. 


INTERNET PROGRAMMING 


CROSS-PLATFORM DHTML 120 


by Charlie Ma 
If browser incompatibilities are driving you batty, Charlie’s techniques for 
cross-platform DHTML development might save the day. 


PROGRAMMER'S TOOLCHEST 


SAX2: THE SIMPLE API FOR XML 130 


by Eldar A. Musayev 
SAX, the “Simple API for XML,” is an efficient and high-performance 
alternative to the Document Object Model. 


COLUMNS 


PROGRAMMING PARADIGMS 135 
by Michael Swaine 

So Java isn’t your cup of tea? Michael explores the alternatives. 

C PROGRAMMING 139 
by Al Stevens 


Al mulls over all the charges against Microsoft— and changes his mind 
about the whole mess. 


JAVA Q&A 145 
by Elisabeth Strunk 

Elisabeth builds a JNI wrapper that lets the Java Authentication and 

Authorization Service and Windows NT authenticate specific users. 


ALGORITHM ALLEY 15] 
by Thomas Gettys 

Thomas presents an efficient hash technique guaranteed to generate a 

perfect hash function for an arbitrary set of numeric search keys. 


DR. ECCO’S OMNIHEURIST CORNER 156 
by Dennis E. Shasha 

Up in the Alaskan tundra, oil and caribou mix about as well as 

oil and water. 


PROGRAMMER’S BOOKSHELF 159 
by Gregory V. Wilson 

Greg looks at a bunch of books, including Programming Ruby, Program 
Development in Java, The Interpretation of Object-Oriented Programming 
Languages, MMIXware: A RISC Computer for the Third Millennium, Essential 

XML, XML Processing with Python, Presenting C#, and Women in Computer 
Sciences: Closing the Gap in Higher Education. 
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What shape is your database project? A tiny embedded database that requires a small footprint but demands 
rigorous functionality? Or a huge multi-platform, multi-user, multi-neadache project? Have you struggled to find a 
better solution for your data access requirements, only to feel like you're sacrificing one requirement over another? 


FairCom has delivered uncompromising database technology to 
commercial developers for over twenty years, with the primary 
goal of keeping control in the hands of the developer. Our 


customers rely on the speed, flexibility, scalability and 


reliability of c-tree Plus. The high performance and low cost of 
ownership make c-tree Plus an excellent choice for all sizes 


of database development projects. 


Proven Database 
Technology 





Besides the ISAM-level control and speed, perhaps the most important 


reason why small development houses and Fortune 1000 
companies have chosen FairCom technology is 
the superior service offered by our sales and support staff. 


Our development staff understands the challenges you face, 
and we'll put our twenty years of experience to work to help 


you build better solutions. 


Single Solution for Diverse 


Implementation 
* Vertical Markets * E-Commerce 
* Embedded Systems * Smart-Cards 


* Web & ASP Markets * Web-Enabled Appliances 
Comprehensive Feature Set 


* Royalty-free single user and multi-user support with 
full source code 

* Client/server and custom server support 

* Robust server side SDK - build your own application 
specific database server with full source available! 

* Thread compliant + portable thread API 

Full featured transaction processing with savepoints, 

abort and full rollback 

Comprehensive security and encryption features 

Small footprint 

Fixed and variable length records and keys 

Store any data type - up to 18 million terabytes! 

Dynamic space reclamation 

ODBC and Crystal Reports™ drivers 

Full ISAM functionality 

Powerful stand-alone index support 


Multiple Platform Support 


2 *® &¢ ©  &@ & 


QUALITY*THROUGH*EXPERIENCE 
























VVAVVAV VE tT ialetelas Motel an)so(elt— 





DBMS Since 1979 * USA. 800.234.8180 °* info@faircom.com 


Other company and product names are registered trademarks or trademarks of their respective owners. 


6 


© 2000 FairCom Corporation 


Dr. Dobb’s Journal, February 2001 









SOFTWARE 
I’ () ‘Glia TOOLS FOR THE 
é PROFESSIONAL 


| eee Un |, WO PR | PROGRAILER 








PUBLISHER EDITOR-IN-CHIEF 
Timothy Trickett Jonathan Erickson 
EDITORIAL 

MANAGING EDITOR 

Deirdre Blake 


MANAGING EDITOR, DIGITAL MEDIA 
Kevin Carlson 

SENIOR EDITOR 

Nicholas Baran 

SENIOR EDITOR, DIGITAL MEDIA 
John Dorsey 

SENIOR PRODUCTION EDITOR 
Monica E. Berg 

ASSOCIATE EDITOR 

Amy Lincicum 

ASSOCIATE NEWS EDITOR 

Shannon Cochran 

EDITORIAL ASSISTANT 

Della Song 

ART DIRECTOR 

Margaret A. Anderson 

INTERNET BROADCAST PRODUCER 
Philippe Lourier 

INTERNET BROADCAST ASSISTANT 
Sean Horton 

SENIOR CONTRIBUTING EDITOR 

Al Stevens 

CONTRIBUTING EDITORS 

Tom Genereaux, Bruce Schneier, Ray Duncan, 
Jack Woebr, Jon Bentley, Dennis Shasha, Tim Kientzle, 
Gregory V. Wilson, Mark Nelson 
EDITOR-AT-LARGE 

Michael Swaine 

PRODUCTION 

Denise Denis, Chad Auger 


INTERNET OPERATIONS 

DIRECTOR 

Michael Calderon 

SENIOR WEB DEVELOPER 

Steve Goyette 

WEB DEVELOPER 

Bryan McCormick 

WEBMASTERS 

Sean Coady, Dana LaPoint, Joe Lucca, Randy Reames 


CIRCULATION 

DIRECTOR OF CIRCULATION 

Kathy Henry 

ASSISTANT DIRECTOR OF CIRCULATION 
Troy Smith 

CIRCULATION MANAGER 

Cherilyn Olmsted 


MARKETING/ADVERTISING 

ASSOCIATE PUBLISHER 

Brenner Fuller 

SALES DIRECTOR, WEST 

Stan Barnes 

MARKETING DIRECTOR 

Jessica Hamilton 

AUDIENCE DEVELOPMENT DIRECTOR 

Ron Cordek 

MARKETING MANAGER 

Marquita Tinio 

ACCOUNT MANAGERS see page 161 

Michael Beasley, Randy Byers, Lilly Gutierrez, 
Linda Guyette, Michael Kelleher, Cassandra Kogelschatz, 
Caroline Oidem, Gabriel Rogol, Chona Sommers, 
Mark Stone, Marla Wood 

GRAPHIC DESIGNER 

Carey Perez 


DR. DOBB’S JOURNAL 
2800 Campus Drive, San Mateo, CA 94403 
650-513-4300. http://www.ddj.com/ 


CMP, INC. 

Gary Marshall, PRESIDENT AND CEO 

John Russell, CORPORATE PRESIDENT AND COO 

John Day, CFO 

Adam K. Marder, GROUP PRESIDENT, BUSINESS 
TECHNOLOGY GROUP 

Regina Starr Ridley, GROUP PRESIDENT, SPECIALIZED 
TECHNOLOGIES GROUP 

Pam Watkins, GROUP PRESIDENT, CHANNEL GROUP 
Steve Weitzner, GROUP PRESIDENT, ELECTRONICS GROUP 
Bill Howard, SENIOR VICE PRESIDENT, GLOBAL SALES 
AND MARKETING 

Vittoria Borazio, SENIOR VICE PRESIDENT, BUSINESS 
DEVELOPMENT 

Peter Hutchinson, SENIOR VICE PRESIDENT, SOFTWARE 
STRATEGIES GROUP 

Peter Westerman, VICE PRESIDENT/GROUP PUBLISHER 


Estimated print run 170,000 





B. 





Printed in the 


USA VBPA 
hitp://www.ddj.com 


ABP 


American Buisness Press 











developers used 





Which of these 








"“DBTools.h++ Offering 
and Productivity For 








“@ 

va) 

. 

—. = 

o| O 

= © 

. ho 

C: aw O 

Cc Oo 

we 9° Oo 

Ww “aD 

5p BBD SY 

- O w 

oR O 
— 9 Oo 
ie 
a yw 
O'S 


Rogue Wave's 
DBTools.h++™ t) 










ime 


ibiner 


? 


Database Applications” and 


Joy. 
test3 


Direct Control o 


in DBTools.h++ Applications. 
der of the free t 


Open SQL 
as a remin 


SQL 
Just For registering, we'll send 


you a Rogue Wave car 


DBTools.h++ users en 


w 











= 


SOFTWARE 


ROGUE WAVE 





www.roguewave 
EUROPEAN HDQTS 
www.roguewave.nl 








EDITORIAL 





D 7 ith all the hot air in Florida swirling around the presidential race, news of another contest— 
C] a Vu KDE versus GNOME— almost slipped by. Butterfly ballots aside, KDE and GNOME are 
; windowing systems that lay to rest the common misperception that Linux is a command-line 
A / / Over A 2al n operating system with no appeal to Windows-weaned users. In truth, there are several graphical 
desktop environments for Linux, with KDE and GNOME leading the pack. 

On the surface, KDE and GNOME are two chads in the same pod. The K Desktop Environment 
(KDE) is an open-source project that includes a window manager, file manager, panel, control center, 
and the like (see http://www.kde.org/). Likewise, the GNU Network Object Model Environment 
(GNOME) is a free software project that provides a consistent interface, file manager, drag-and-drop, 
objects on the desktop, and so on (see http://www.gnome.org/). 

Both environments were developed to provide an easy-to-use desktop alternative to the command 
line. And both projects are supported by a full complement of applications, ranging from games and 
productivity tools, to development tools, such as GUI builders and scripting languages. Under the 
hood, however, there are some fundamental differences between the two. KDE is designed around 
the concept of interprocess communication and built using the Qt library. GNOME, on the other 

hand, is built using the GIK+ widget set and adopts a CORBA-like architecture. 
While there is perhaps some degree of compatibility between the two systems 
(including limited drag-and-drop), the ability to run an unmodified KDE 
application under GNOME can be difficult at best. And, according to both the 
KDE and GNOME project leaders, it is likely to stay that way. 

Frankly, there’s nothing really wrong with this lack of interoperabilty. For the 
most part, if a GNOME application developer wants a program to run on KDE, 
he just writes a KDE version of it. Right now there are hundreds of programs 
with either the prefix “g” (for GNOME or GTK+) or “k” (for KDE). Furthermore, 
: choice between desktop environments is a good thing for users, as is 
: competition between the two development groups. 

So, you'd think that two projects with similar goals and mindsets— and a 
common enemy— could at least get along. That doesn’t appear to be in the 
wind, however, as there’s every indication that proponents of KDE and GNOME are about to make 
the same mistakes as combatants in the UNIX versus UNIX wars of the 1980s. 

The opening salvo was fired by GNOME with the launch of the GNOME Foundation— a 
consortium whose mission is to “help set the technical direction of the GNOME project, promote the 
broad adoption of GNOME on Linux and UNIX desktops and offer a forum for industry leaders to 
contribute to GNOME.” The GNOME Foundation is backed by organizations such as Compaq, Eazel, 
Free Software Foundation, Gnumatic, Helix Code, Henzai, HP, IBM, OMG, Red Hat, Sun, TurboLinux, 
and VA Linux. The Foundation will be directed by an advisory board initially consisting of 
representatives from the corporations and organizations just mentioned. In short, the Foundation will 
“provide organizational, financial and legal support to the GNOME project, and help determine its 
vision and roadmap.” 

Not to be outdone, the KDE folks fired back with the KDE League, a consortium whose 
mission is to facilitate “the promotion, distribution and development of KDE, with the goal of 
establishing KDE as a desktop standard for PCs, workstations and mobile devices.” In this case, 
the founding members include the likes of Borland, Caldera, Compaq, Corel, Fujitsu-Siemens, HP, 
IBM, KDE.com, Klardlvdalens Datakonsult, thekompany.com, Mandrakesoft, SuSE, Trolltech, and 
TurboLinux. Governance of this consortium is controlled by developers and corporate sponsors 
who share power equally. 

Jumpin’ Jehoshaphat, this sounds like déja vu all over again! If you recall, back in the 1980s — 
before Windows owned the desktop —UNIX was posed to take a run at dominance. Hoping to see a 
single unfettered standard emerge, AT&T had stepped back from its draconian hold on the OS, 
enabling Sun, the Hamilton Group, the University of California BSD folks, and others to quickly fill 
the void. But what emerged from all this was, on one hand, the Open Software Foundation 
(consisting of IBM, DEC, HP, Silicon Graphics, and the like), and on the other, UNIX System V 
(backed by AT&T, Sun, Unisys, Motorola, Toshiba, and so on). The end result was confusion— not 
unity— and the door opened for Windows to dominate the desktop. 

No one is saying that KDE and GNOME need to get all touchy feely and warm and fuzzy with 
each other. Nor, for that matter, should either project develop reactively to the other. But as KDE and 
GNOME look to the future, they might also consider the past and try not to repeat the same mistakes 
of UNIXes of a generation ago. After all Toto, we’re not in Florida anymore. 


Sldicn. 


Jonathan Erickson 
editor-in-chief 
jerickson@ddj.com 
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LETTERS 





Analyzing Algorithms 

Dear DDJ, 

In the January 2000 “Letters,” Todd Stephan 
claims that you can leave out the sqrt when 
solving the traveling salesman problem. Un- 
fortunately, this isn’t true. Here’s why. In 
effect, by saying the sqrt is redundant, you 


are suggesting that a\2+bA2>cA2+dA2=> — 


at+b>c+d (i.e., if the sum of the distances 
squared is greater, then so is the sum of 
the distances). But consider a=1, D=2.7, 
c=d=2. a\2+bA2=8.29 and cA2+dA2=8. 


8.29>8. So far, so good. But 1+2.7=3.7< 


2+2=4. Oops! 
Ben Laurie 
ben@algroup.co.uk 


Child’s Play 

Dear DDJ, 

In regard to Michael Swaine’s “Program- 
ming Paradigms” column entitled “Child- 
hood’s End” (DD/, November 2000), I 
agree with the idea that the children of 
tomorrow will adapt to an entirely new 
world through sophisticated interactive 
toys, leaving us nonchildren far behind. 
This is somewhat evident, even today. The 
sophisticated CAD software of the previ- 
ous decade has become the “Crayola 3D 
Castle Creator” that my preschool daugh- 
ters fool around with on the computer (ac- 
tually, I play with it more than they do— 
perhaps there is hope for nonchildren after 
all). But no matter how intricate or life- 


like the toys of tomorrow may be, I can- 


not help but believe that the children of 


the future would still find more interest- | 


ing things to do with the cardboard box 
the toys are delivered in. Some things will 
never change. 

James Metzger 

j.t.metzger@ieee.org 


Hurd, Hurd, Hurd — 

Hurd Is the Word 

Dear DDJ, 

Jerry Epplin’s excellent article on “The 
HURD” (DDJ, December 2000) seems to be 


missing one point that I think is important. 
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He mentions that there are a number of 
“toy” microkernel-based operating systems 
that have been released over the years, but 
he doesn’t mention the other large-scale 
microkernel OS projects, which are more 
similar to the HURD. Most Mach-based OSs 
included a complete BSD engine, and in 
that respect, the HURD and these other OSs 
are similar in that they all run UNIX. The 
difference is an important one though. 

In the past, the kernel was seen as the 
only interesting area for research, likely as 
a side effect of it being a highly academic 
effort (at least to start). Although the goal 
was to help OS development, the benefits 
a microkernel offered to OS developers 
were unrealized unless the OS was modu- 
larized. But that was largely seen as work 
for someone else to do, and in the end, no 
one ever seemed to get around to it. 

UNIX was ported with just enough 
changes to get it running, because it was 
being included largely for utility value to 
support kernel development. Over a decade 
later and UNIX on Mach is largely as mono- 
lithic as the original UNIX that spawned 
them— perhaps even more so due to a lack 
of maintenance. To illustrate this point con- 
sider MkLinux, which was a port of Linux 
in essentially unchanged form so it would 
run on the Mk kernel. There’s no advan- 
tage to this version, and it runs slower too. 

The reason I find the HURD com- 
pelling is that it’s the first concerted ef- 


_ fort to close the loop and actually fix the 


OS itself. In the HURD the kernel is of 
secondary importance, the modulariza- 
tion of the OS itself is the key reason the 
project exists. It’s only with the HURD 
that the advantages of the microkernel 
Jerry outlines in his article will become 
fully realized. 

Maury Markowitz 

maury@fintech.com 


Software in the 21st Century 

Dear DDJ, 

In regards to Eugene Kim’s article “The 
Future of Programming” (Dr. Dobb’s Spe- 
cial Report on Software in the 21st Cen- 
tury” (December 2000): The problem is 
not that programmers on [the] whole are 
lazy, impatient, and arrogant— it is that 
there are increasingly more mediocre pro- 
grammers. To imply that all programmers 
fit into this derogatory species is both in- 
sulting and erroneous. Subsequent con- 
clusions based on erroneous assumptions 


are also erroneous. 


Rather than relying exclusively on Larry 
Wall’s view of programmers, readers may 
want to consult a leading expert in the field, 
Richard W. Hamming, in particular his 
chapter “How To Think About Trends” in 
Beyond Calculation: The Next Fifty Years 
of Computing (ISBN 0-387-94932-1). 


Dr. Dobb’s Journal, February 2001 


Anybody can program a computer. With 
today’s visual programming, rapid appli- 
cation development, wizards, and the nu- 
merous scripting languages (like Perl), 
more and more people are becoming 
mediocre programmers. Because you can 
program a computer does not mean you 
should do it for a living. 

While [it] is true that “tools and good 
practices will make programming easier,” 
Hamming suggests, “until we have lan- 
guages that help us think about the orig- 
inal problem and its proposed algorithms, 
there will be only slow improvements in 
programming effort.” 

Edward Harned 

ed@coopsoft.com 


Analyzing Analytic Computing 

Dear DDJ, 

Laurent Bernardin’s article on “Analytic 
Computing” (DDJ, September 2000) gives 
an interesting overview of the synthesis 
of numerical and symbolic computation 
using Maple. 

Regarding his first example, I recently 
became aware of a different recursive 
computation for terms in the Fibonacci 
sequence. (This was shared by a col- 
league, so'I don’t have a reference for the 
description below. I did locate a different 
derivation of essentially the same result— 
where else but in Donald Knuth’s Art of 
Computer Programming Vol. 1). 

The recursion is defined by: 


F(O) = 0; 
FC) = 1; 
F(2) = 1; 


/ 
| F((n+1)/2)A2+F(n-1)/2)A2_ if n is odd 
F(n)=/ : 


\ 
| F(n/2+1)A2—F(n/2-1)A2 if nis even 
\ | 


This recursion has depth proportional 
to log_2(w rather than n. Whereas the it- 
erative computation of F(100000) takes al- 
most exactly 45 seconds on a 650-MHz 
Pentium-III Gust as Bernardin predicts), 
the recursive computation above takes 
about 15 seconds, even without “options 
remember.” If options remember is used, 
F(100000) is computed almost instanta- 
neously. 

To see why this works, you can first 
derive that the mth Fibonacci term (suit- 
ably reindexed) counts the number of bi- 
nary strings of length n with no repeat- 
ing ones. To see this, first observe that 
there is one such string of length zero 
(the empty string has no consecutive 
ones) and two such strings of length one 
(0 and 1). Now suppose that the result 
holds for strings of length k or less. You 
can extend a string of length k by adding 
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A Lesson from the 
Auto Industry 
by Adam Kolawa 
Focusing on error prevention methods such as 
enforcing coding standards decreases the number of 


bugs you have to handle in the later stages of 
development, where bugs are often more difficult and 


costly to fix. To see how potent error prevention can 


be, consider the auto industry. 

During the 1970s many car makers were plagued 
with defects. Instead of examining the manufacturing 
process itself, American car makers focused their 
efforts on inspecting the cars as they came off the 


production line, and fixing the defects found. On the | 


other hand, the Japanese car makers improved the 
Droduction processes that caused defects by takings 
Steps to reduce the risk of producing faulty cars. They 
Spent less time finding and fixing defects after the 
cars were made. The quality of their cars was higher 
and they outsold their American competitors. 

So what does this have to do with software? The 
current practice in software development resembles 
the unsuccessful American strategy: the software is 
written as quickly as possible, then, once the 
application is done, it's tested and debugged. Clearly, 
the Japanese method is the one to use. 

One way to apply that successful strategy in 
Software development is through coding standards 
enforcement. Coding standards are language-specific 
"rules" that, if followed, will significantly reduce the 


onportunity for developers to introduce errors into an | 


application. For example, 


class FOO { 
private: 
char *eptr; 
public: 
FOO(char *x) { 
cotr=strdup (x) ;//ignore null case here 
} 
virtual ~FOO( ) { 
1f(cptr) free(cptr) ; 


+3 
Since no copy constructor is defined in this C++ 


code, the compiler will copy the class using a default 
copy constructor, and the pointer will be copied to a | 


new class. The outcome will only be correct if a new 
memory location was not supposed to be created. The 
best way to prevent this potential error is to enforce a 
coding standard that will remind you to write a copy 
constructor if you have any pointers in your class, 
reducing the risk of an incorrect outcome. 


Adam Kolawa, Ph.D., is Chairman and CEO of 
ParaSoft. You can reach him at ak@parasoft.com 


Parasoft’ 


CodeWizard® prevents C/C++ errors 
and slashes your debugging time 


"CodeWizard is an excellent tool. It has helped me spot errors in my software, and it's made 
me a better programmer." Robert Morgan, Captivation Digital Laboratories, Inc. 


CodeWizard® is a tool that helps you prevent errors by 
automatically performing static analysis on C/C++ 
code. The first thing you'll notice about CodeWizard is 
that it cuts down on the thing you hate the most: 
debugging. Preventing errors with CodeWizard is 
painless and saves you time in the long run. 
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CodeWizard enforces over 120 sophisticated 
C/C+ coding standards. 


CodeWizard automatically enforces a sophisticated set 
of C/C++ coding standards. These language-specific 
rules are designed by industry experts such as Scott 
Meyers to help you avoid dangerous coding 
constructs. CodeWizard parses your code, clearly 
displays violations of the coding standards, and 

gives you all the information you need to make 
repairs, including suggestions on better coding 
constructs. Follow CodeWizard's coding 
Suggestions, and you'll immediately begin to 

write cleaner, more reliable code. 


The key to CodeWizard is that you can 
customize the program to fit your 
development schedule. With a few 
clicks, you can tell CodeWizard to 
enforce only the standards that are 


most important to you at a particular stage of your 
project. Using the RuleWizard™ feature, you can 
point and click to build your own sophisticated 
coding standards. RuleWizard guides you through 
the coding standard building process and lets you 
know when your coding standard is ready for 
enforcement by CodeWizard. 


You won't have any problem adding CodeWizard to 
your arsenal of development tools, because it installs 
directly into Windows and UNIX development 
environments. In Microsoft Developer Studio®, you'll 
begin testing with a single click of the CodeWizard 
icon. On UNIX platforms, CodeWizard is a wrapper 
around the compiler. Learn a few simple commands, 
and you'll be on your way to cleaner code. 


If you would like to prevent errors with a 
customizable tool that conforms to your development 
cycle, download CodeWizard today at 
www.parasoft.com/ddj2, or call (888) 305-0041 for 
more information. For a special incentive, download a 
demo this month. 


Download your 


FREE DEMO at: 


ParaSoft’ www. parasoft.com/ddj2 





© 2001 ParaSoft Corporation, Monrovia, CA, USA. ParaSoft and CodeWizard are registered trademarks of ParaSoft Corporation and RuleWizard 
is a trademark of Parasoft Corporation. Microsoft Developer Studio is a registered trademark of Microsoft in the USA and other countries. 





(continued from page 10) 

a bit on the left end. How many strings 
of length k+1 with no consecutive ones 
are there? 

If the new left-most bit is 0, than the rest 
of the string can be any of the strings of 
length k. If the new left-most bit is 1, then 
the following bit must be 0 and the re- 
maining bits can be any of the strings of 
length k-2. Thus, the total number of strings 
of length k+1 is F(R+1) = F(R) +F(R-1) — 
the Fibonacci sequence. 

Now consider a different way of com- 
puting the count of such strings. Consider 
odd-length strings of length 2k+1. You can 
count the number of strings without con- 
secutive ones as follows: Look at the mid- 
dle bit. If this bit is 0, then you can con- 
struct acceptable strings using any 
acceptable string of length k on the left and 
any acceptable string of length k on the 
right. If the middle bit is 1 then the bits on 
either side must be 0, and the remaining 
bits can be taken from any pair of accept- 
able strings of length k—-1. So the total num- 
ber of such strings is F2R+1)=F(R)A2+ 
F(k-1)A2. For even-length strings, you take 
one of the two middle bits and follow the 
same logic to get F(2k)=F(k) F(k-1)+ 
F(k-1) *F(k-2). Observing that F(k-1)= 
F(k)— F(k—-2), you see that: 
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F(2k)=F(k)*[F(k)-Fk—2)]+1F(k)-Fk—2)]*F(k—2) 
=F(k)A2—F(k—2)A2. 


Renaming the indices appropriately gives 
the formula above. 

This idea illustrates that recursive com- 
putation can be much more efficient than 
iterative computation. The trick is to use 
the recursive viewpoint to find ways to 
divide the work of computing a particu- 
lar result into subtasks that are different 
from those that come to mind when one 
thinks about iterating. Other results in 
the same vein include Karatsuba’s method 
for multiplying polynomials, Strassen’s 
method for multiplying matrices, and the 
Fast Fourier Transform. 

Matthew Saltzman 

mjs@clemson.edu 


Giving Boost a Boost 

Dear DD], 

I thoroughly enjoyed the concepts of 
Boost (see “C++ Type Traits,” by John 
Maddock and Steve Cleary, DD/, Octo- 
ber 2000) and took particular pleasure in 
grappling with how the implementation 
actually worked. Unfortunately, I now 
need to go and have a good lie down. I 
suggest that I was lucky to survive, and 
that a safety warning be inserted into the 


Dr. Dobb’s Journal, February 2001 


source code: “These templates were cre- 
ated by trained professionals in controlled 
conditions. Do not try this at home.” 
Claude Brown 
claude@earthling.net 


MP3 Notes 
Dear DD], 
In regards to Al Stevens’s “Into the World 
of MP3” article: I have never had any MP3 
software crash on me in Linux. The list of 
MP3 toys I have played with is long: 
mpg123, xmms, bladeenc, lame, grip, and 
so on. I have lots of CDs and have almost 
converted them all to MP3. I just load one 
in, select all tracks, and hit the rip+encode 
button. When it’s done, it opens the CD 
drawer, I switch disks, lather, rinse, and 
repeat. Occasionally, the disk is not in the 
CDDB database, and I have to manually 
enter disk/track/artist information. But I 
do not ever remember having the software 
crash. I’m probably lucky, but...Glad I’m 
not using Windows! Thanks for the infor- 
mative article. 

Robert Wuest 

rwuest@wuest.org 
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Software Developers 


HASP'4 - Turn Your Lost 
Software Into Profits 















HASP4 


The Professional Software 
Protection System 





Identification Card 
Turn your lost software into profits. By preventing unauthorized 
software usage and duplication, HASP4 protects intellectual property 
rights and increases revenues. You will be able to reduce your 


Mission: To protect your software & Increase your revenues 
HASP is: A small hardware key that connects to the 


software prices, allowing your customers to reduce their prices too. computer and prevents unauthorized use 
It enforces the legitimate use and licensing of software applications. pe 

— Features & Prevents software piracy 
HASP4 controls software distribution and expands sales channels. Benefits Controls software distribution 
Companies such as 3M, Lucent, HP, Cadlink and Scitex know that Protects intellectual property 


Increases revenues 
Manages network licensing 
Reduces software distribution costs 


they can count on HASP to protect their software products. 







Order your HASP4 Developer's Kit at: 
www.eAladdin.com/DDJ 
Or Call 800-223-4277 EST, & 800-562-2543 PST, CST, MST. 
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100+ pages of products 
and information 
for Developers! 


Get a FREE Subscription 
to our catalog by calling 







or subscribe at 









DevTrack 
by TechExcel 











DevTrack is the 
premiere defect- 
and project-tracking 
tool for software 
development teams, helping 
to ensure that development projects finish on 
time and on budget. DevTrack comprehensively 
tracks and manages all defects, change/feature 
requests, and all other development issues. 
DevTrack also provides powerful workflow and 
process automation features, robust searching 
and reporting, and comprehensive point-and- 
click customization. Intuitive and powerful, 
DevTrack provides a scalable out-of-the-box 
solution, at a great value. 


Benefits Include: 

e Improved efficiency 

e Synchronized releases 

e Better project management 
e Higher software quality 
e Reduced development costs 
e Greater customer satisfaction. 
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Now you can easily create dazzling, 
lightning fast charts for Active Server 
Page (ASP) applications with True 
WebChart 6.0, which has Olectra 
Chart built in! With True WebChart, 
you can serve your Olectra charts 
as JPEG or PNG to any browser. Or 
serve those charts as an Activex 
control to allow your end-users to 
change chart types and data values! 
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by Seapine Software 


TestIrack 3.0 is the most power- 
ful defect tracking solution for 


Testi rack 





feature-rich solution to any user 


eb sel ha including: defect tracking, feature request, 
S96 01 50-FH customer & user test configurations, email 


notification, email bug importing, duplicate 
defect-handling, release note generation, 
and unlimited customer care. Distribute 
SoloBug, TestTrack’s defect reporter, to 
automate your customer support system. 
TestTrack provides your team with the 
most powerful defect tracking system. 


VForm 3.0 Form/Grid MFC Control 
by TCK Software 


Professional Ed. 
Paradise No. 
$96 0151-EH 


Use the GUI Builder to easily create 
incredible Forms, Grids and Reports, 
or dynamically build them in your code. 
An entire UI in a box with complete 
flexibility and control that is incredibly 
easy to use. No other control even 
comes close! 


Dynamically create full featured grids 
from just an SQL statement. 


Paradise No. 
T40 0130-EH 


Version 3.0 fully supports OLE DB, & ADO. 


jot) oe Sizer 2000 
[QZ by FMS Inc. 


Total Components Sizer 6 is a 
package of two ActiveX controls for 
Visual Basic developers. The Sizer 
controls provide form resizing and 
splitter bar functionality. The Resizer 
control is a fully functional resizer 
control, that when placed on a form, 
automatically resizes the form’s 
controls as the form is resized. The 
Splitter control is a container-based 
object that provides a splitter bar that 
allows two panes to be sized in a 
horizontal or vertical orientation. 
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True WebChart™ 6.0 and VS Combo Pak™ 7.0 
by ComponentOne, formed by the merger of APEX and VideoSoft 


Testirack for Web & Windows 


}| Windows. TestTrack 3.0 delivers a 


group by providing concise tracking abilities 


















visit us at programmersparadise.com 


or call 800-445-7899 








The Combo Pak combines two of 
the industry's most popular, award- 
winning products—VSFlexGrid Pro 
7.0, a data-aware grid control; and 
VSVIEW 7.0, a powerful printing 
and print previewing engine—in a 
single package. 
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Multi-Edit 9 


by American Cybernetics 


Multi-Edit 9, the professional 
programmer's text editor is now 
packed with more features than ever 
before. New features include: Visual 
Studio Integration; ColdFusion 
Integration; improved Project Manager; 
language support for Python; SQL; 
Clarion; REBOL; and more. 
Experience enhanced PERL support 
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sacl eomwale Performance Tools 
by Intel 


e VTune™ Performance Analyzer non- 
intrusive sampling and call graph 
profiling offer multiple ways to 
understand code performance. 

e Intel? C++ Compiler* designed from 
the ground up to take full advantage 


gi Sold nee 
Call for pricing 
on compilers. 


VTune Performance 


Analyzer of Intel’s latest processors. 
Paradise No. e Intel® Fortran Compiler* delivers 
123 0142-EH 


outstanding application performance 
with advanced features like Profile- 
Guided Optimization. 


dtSearch Web 


Add instant searching to your site! 
e Over a dozen text search options 


e Highlights hits in XML, HTML & PDF 
while keeping links & images intact 


e Converts word processor, database, 
spreadsheet, ZIP, etc. to HTML for 
display with highlighted hits 

e Point & click setup 

e Includes dtSearch Desktop with Spider. 


Per Server Ask about: dtSearch Network & dtSearch 
Paradise No. Text Retrieval Engine with sample code in 
D29 0530-EH multiple programming languages. “A 


tremendously powerful and capable text 
search engine”—Visual Developer. 
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Tremendous Selection! 


Personal Service! 


SPF/SourceEdit 3.5 


by Command Technology 


SPF/SE provides file 
management and editing similar 
| to ISPF for Win95/98/2000/NT 
| Features include: Variable or fixed length 
| records; Record lengths to 64,000; File 
size to 100M; ASCII or EBCDIC character 
set; Multiple line numbering modes; 
Hex display/modify; Source colorization; 
Powerful file lists; Projects; “C” macro 
language; and much more. 


Help Development 
Buy now and get: 


program—$99 Value 
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Download a DEMO Total Retail Value—$1,098 
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LEADTOOLS Imaging 


by LEAD Technologies, Inc. 

LEAD’s award-winning imaging 
technology is chosen by Microsoft, 
Hewlett Packard, Intel, Boeing, Xerox, 
and thousands of other companies for 
use in their high-volume applications 
and internal systems. LEADTOOLS gives 
developers the most flexible and powerful 
imaging technology available offering 
imaging technology in 19 categories 
including: import/export; compression; 
image processing; color conversion; 
display/special effects; Internet/intranet; 
annotations; medical imaging; OCR; 
barcode recognition; and much more! 


Visual SlickEdit® v6.0 


by MicroEdge, Inc. 


MicroEdge, Inc. announces Visual 
SlickEdit v6.0! Enhanced Context 
Tagging™ for C++, Java, HTML, JSP, 
PHP, JavaScript and more. Auto List 
Members and Auto Parameter Info 
window lists now sizeable. New Java 
tool integration. Enhanced DIFFzilla®. 
New JavaBean support and Multiple 
Build Configurations. 
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Download a 
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c-tree Plus® 
by FairCom 
1 Pkg supports 25+ 0/S 


Windows Linux ¢ Mac e Novell ¢ 
Lynx ¢ QNX @ AIX @ FreeBSD © HP UX @ 
SGI ¢ Solaris ¢ Tru64 ¢ Many more! 

This well-known data management 
package is the tool of choice for serious 
database developers. Low-level and high- 
level ISAM functions offer an unprecedent- 
ed level of control. Build fast, royalty-free 
single-user and multi-user applications, 
or utilize the optional FairCom database 
Server. ODBC and Seagate™ Crystal 
Reports drivers available. Small footprint, 
cross-platform support, full source code. 
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by eHelp Corporation 
The Industry Standard in 


e RoboHelp Office 9-$899 Value 
e FREE NetStudio—Web graphics 


e FREE FullShot 6.0—Screen 
capture program—$80 Value 

e FREE WebHelp Resource Kit— 
WebHelp Guide & CD- 


Limited inventory—While supplies last! $840.°° 
























GUARANTEED BEST PRICES* 


_J\ Should you see one of these products listed at 

| a lower price in another ad in this magazine, [Ze 

41 CALL US! We'll match the price, and still offer [se 

>| our same quality service and support! 

“Terms of the offer: 

e Offer good through February 28, 2001 

e Applicable to pricing on current 
versions of software listed 

e February issue prices only 

e Offer does not apply towards 
obvious errors in competitors’ ads 

e Subject to same terms and conditions 

Prices subject to change. 

Not —_ for se errors. 
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$20 Value 


FarPoint’s Spread 3 
by FarPoint Technologies 


Use FarPoint’s Spread 3 to create powerful 
database front-ends, manage the display 
and entry of two billion items using the 
enhanced Virtual Mode, print reports 
using Print Preview, perform calculations, 
import/export Excel 97 files (32-bit), 
export HTML files (32-bit), sort data, 
support OLE Drag and Drop, or take 
advantage of its unparalleled cell-level 
formatting, including twelve built-in cell 
types. Unmatched power, flexibility, and 
speed combine to make Spread the 
perfect component for any application. 


PR-Tracker™ 


by Softwise Company 

Bug Tracking...doesn't 

have to be expensive! 

Integrate bug tracking over the 
Internet and a local area network in 
one database with the same easy-to- 
use interface. 


PR-Tracker’s simple yet powerful 
interface requires no training and 
enables you to configure a project in 
about 15 minutes. Features include: 
customizable data entry, workflow, 
views and queries; one click queries; 
attachments and email notification. 


eWasp Bar Code 


Web Component 


by Informatics 

The Leading Bar Code 

Web Developer’s Toolkit 
eWasp Web Components allows you 
to easily create bar codes that can be 
printed from your client’s browser. 
Enhance your website with bar codes 
by creating on-line coupons, tickets, 
check-in confirmations, license 
registrations, rebate mailers, RMA 
packing slips, invoices, electronic 
gift certificates, and much more. 
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Infragistics 
United We Stand 


You know who we are. Because you know who we were. Sheridan Software Systems and 





ProtoView Development have joined together in January 2001, as a powerful, new force in 
the component industry. Both have been recognized leaders in the software component 


market since its beginning, with over 185 industry awards combined. 


Infragistics’ diverse product line for COM and Java environments is used worldwide by 
nearly every Fortune 2000 company, over 85,000 corporations and 187,000 independent 
developers and consultants. Additionally, we offer services to help corporate clients gain 
the most from their investment through training, mentoring and implementation 
assistance. And soon we'll introduce a series of components written in C# for the 


upcoming Microsoft .NET environment. 
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It’s a brave new world. Now get the best of both worlds with Infragistics’ 





| newest, most comprehensive offering - UltraSuite™. We’ve combined our best- 


of-breed products into one positively jam-packed solution. Along with the 
| familiar, easy-to-use interfaces of Microsoft® Outlook, Office and the Windows 
Explorer, we offer the most innovative grid available - UltraGrid™, and our 


| UltraToolBars™ is loaded with functionality. With 45 controls included, you'll 





| have everything you need to create solutions that look great and run efficiently, 
| and deliver them faster than ever before. There’s nothing like the productivity, 


| flexibility and support of UltraSuite. 
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Cofounder of Fairchild Finally, Phomas Kilbum: working at_ Dreams of a vital Ghesevatory 
Semiconductor Dies the University of Manchester in the Jim Gray, manager of Microsoft's Bay 


Victor Grinich, one of the “Traitorous 
Eight’— the eight defectors from Shock- 


ley Semiconductor who founded 


Fairchild Semiconductor in 1957— has 
died at the age of 75. Grinich was the 
only electrical engineer among the eight 
scientists who founded Fairchild Semi- 
-conductor (the others were physicists 
and materials scientists). — 

Born to Croatian. immigrant parents, 
Grinich was an expert in integrated cir- 
cuits and, after leaving Fairchild, taught 
at Stanford University and the Univer- 
sity of California at Berkeley and was 


the author of the widely used college 


~ textbook, Introduction to Integrated 
Circuits (ISBN 0070248753). Among the 
traitorous eight were Robert Noyce, 


Gordon Moore (the cofounders of In- | 


tel), Sheldon Roberts, Eugene Kliner, 
_ Victor Grinich, Julius Blank, Jean Ho- 
erni, and Jay Last. Led by Robert Noyce, 
these scientists left Nobel Prize winner 
William Shockley’s company due to 
Shockley’s notoriously harsh and un- 
predictable management style (as well 
as his controversial views about race 
and intelligence) and founded Fairchild 
Semiconductor, which is credited with 
building the first mass-produced inte- 
hts circuit. 


: Gommputer History Museum 

Nominates New Fellows 
Frances Allen, Vinton Cerf, and Thomas 
Kilburn have been named Fellows of the 
Computer History Museum Center 
(http://www.computerhistory.org/). 

Frances Allen, a pioneer in the field 
of optimizing compilers, helped create 
one of the first automatic debuggers as 
_ well as the Alpha code-breaking lan- 
guage. She is currently a consultant to 
IBM’s Solutions and Services division, 
and in 1989, became the first woman to 
be named an IBM Fellow. 

Vinton Cerf, together with Robert 
Kahn, developed TCP/IP in 1973. He 
founded The Internet Society in 1992 


and served as its president for three 


years. He is a senior vice president at 
MCI/Worldcom and recently joined a 
team of engineers at NASA, charged with 
developing a wireless communications 
network for enabling Internet access in 
outer si 
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1940s and ’50s, is credited with many 


computer “firsts,” including the Williams 
Tube (the first random access electronic 
storage device); Baby (the first working 
stored- program computer); the ATLAS 
machine (which pioneered new con- 
cepts such as paging, virtual memory, 


and multiprogramming), and in 1972, 
the MU5 programming language. 


Speaking via video link from Man- 
chester, Kilburn said that he especially 


valued the museum’s award “because 
-] believe it has been suggested by 


peers in the States.” 


‘High Sctinolats Win 


Scholarships With New Algorithm 
Competing in the southwest regional 


round of the Siemens Westinghouse Sci- 
ence and Technology Competition, two 


high school students from Saratoga High 
School in California won the team com- 
petition by developing a new lossless 
data compression algorithm called 
“LZC&L.” Allan Chu, a sophomore, and 
Chun-Chieh Lin, a senior, came up with 
an enhanced and faster version of the 
LZ77 compression algorithm in wide use 
today. 

According to Martin Wong, one of 
the judges of the competition and a 
computer science professor at the Uni- 
versity of Texas, the new compression 
algorithm is based on two new tech- 
niques: the “composite fixed-variable- 
length coding” and the “offset- difference 
coding.” The composite coding tech- 
nique unifies fixed-length coding and 
variable-length coding into a single 
coding scheme. The offset- difference 
coding technique removes an inher- 
ent redundancy in previous coding 
schemes and hence can produce small- 
er compressed files. Experimental re- 
sults on a set of standard benchmark 


test problems show that the new al- 


gorithm LZC&L outperforms published 
compression algorithms in the LZ77 
family. 

The Siemens Westinghouse competi- 
tion is open to individuals and teams of 
high school students who develop in- 
dependent research projects in the phys- 
ical or biological sciences, or mathe- 
matics. For more information, see http:// 
www.siemens-foundation.org/. 
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Area Research Center and senior re- 
searcher in the Scalable Servers Research 
Group, operates the TerraServer web 
site, a public database of high-— 


resolution U.S. Geological Survey aeri- _ 


al imagery and topographical maps 
(http://terraserver.microsoft.net/). With — 
20 terabytes of disk storage, the Terra- 
Server is one of the world’s largest on- 
line databases. But the world is no 
longer enough. Gray has now set his 
sights on the skies, and is working with 
the astronomy community to build an 
international “virtual observatory.” 
In a presentation given at Stanford 
University, Gray estimated that all ex- 
isting astronomy data sets would add 
up to a few petabytes— a size he calls 
“huge, but not impossible.” The stor-. 
age challenge is not so much one of | 
memory, but of access. It would take 
about five and a half years to restore 
a petabyte database from tape. So Gray 
wants to do away with tape, store the 
data on mirrored disks at a federation 
of observatories, and have each ob- 
servatory also mirror another site, so 
that ultimately the entire database 
would be stored online four times 
over. Automatic parallelism would then 
quadruple the read bandwidth. Be- 
cause astronomers are unusually gen- 


erous about sharing data, Gray is op- | 


timistic that his dream will become a 
reality. 


Examples of Fair Use Wanted 
The Electronic Frontier Foundation 


(http://www.eff-org/) is “looking for — 


diverse, real-world examples of the 
ways in which the lives of ordinary fair 
users are strangled by the anticircum- 
vention provision of the Digital Mil- 
lennium Copyright Act (DMCA).” The 
recent ruling against Eric Corley, who 
was ordered to take down DeCSS, 
makes it illegal to circumvent access 
controls, regardless of the reason. Ac- 
cording to the EFF, Section 1201 of the 
DMCA violates the First Amendment 
right to Fair Use, and is hoping to find 
solid examples of how Fair Use applies — 
to digital copying. “Points will be giv- _ 
en for brevity, concreteness, and the 
ability to have your grandmother eas- 
ily ee the Pau 
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Less effort. Fewer bugs. The first time. 


It’s one thing to build a setup. It’s another to have it work correctly. But you don’t want the 
hassles of checking every five minutes to see if your project measures up to where it’s 
supposed to be. What a pain. Fortunately, you don’t have to feel it. 


With InstallShield Professional — Windows Installer Edition, you can create projects that debug and mea- 
sure the success of your setup as you build. Hot new features like Source Code Control 
Integration and the Dynamic Scanning Wizard keep you on the right track. And you can use 

the new MSI Debugger and Build Reports to know your setup is being properly built. 

The first time, every time. 


Don’t spend all that time to create an installation only to find out at the end that it isn’t 
perfect. Choose the setup authoring solution that measures your progress for you. 
Choose InstallShield Professional — Windows Installer Edition. 


SEE HOW YOUR PROJECT CAN MEASURE UP ° 
AT WWW.INSTALLSHIELD.COM InstallShield. 
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Authenticating and encrypting connections 
Kirby W. Angell 


Ou sit at your computer, marveling at your distributed Java 

application. Your code creates Socket and ServerSocket ob- 

jects like crazy, sending data across the Internet. It is a 

sight to behold— until you realize that anyone can inter- 
cept the data being read, masquerade as one of your applica- 
tions, and fill your system with bogus data. 

As soon as you start looking into authenticating and encrypt- 
ing the connections between applications, you find that you have 
entered a complex area. When dealing with encryption, you have 
to worry about many things— not just which algorithm you plan 
to use. Attacks to your system can involve the algorithm, proto- 
col, passwords, and other factors you might not even consider. 

Luckily, most of the messy details of authenticating and en- 
crypting traffic between two socket-based applications have 
been worked out in the Secure Sockets Layer (SSL) specifica- 
tion. Sun Microsystems has an implementation of SSL in its Java 
Secure Socket Extension (JSSE; http://java.sun.com/security/) 
package. JSSE and the Java Run-Time Environment (JRE) pro- 
vide most of the tools necessary to implement SSL within your 


Kirby is Chief Product Architect for Apigent Solutions and a con- 


tributing author to The Quick Python Book (Manning, 1999). 
He can be reached at kirbyangell@hotmail.com. 
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Java application if your Java app is a client communicating with 
HTTPS servers. Since the JSSE documentation and tools are most- 
ly geared toward this end, it takes some work to figure out how 
to use the toolset within an application where you need to cre- 
ate both the client and server sides of the connection. Conse- 
quently, in this article, P’'ll create a Java client and server that use 
the JSSE package to securely authenticate and encrypt traffic be- 
tween them. In truth, the code is actually the simplest thing you 
will have to do to get your connections encrypted. The devil is 
in the key management. 

The sample application (available electronically; see “Resource 
Center,” page 5) implements a client/server-based credit-card 
authorization system. The server accepts credit-card transactions 
from any number of remote clients. Each application can be 
configured to use a standard socket or to authorize and encrypt 
sessions with an SSL socket. The SSL analogs of Socket and Server- 
Socket are SSLSocket and SSLServerSocket. Once you get past the 
initial setup of the SSL versions, you can use them in place of 
the standard socket classes. Listing One shows the client appli- 
cation’s (client.java) sendData function, which is responsible for 
interacting with the server once the connection has been made. 
There is nothing specific to SSL in the code— it simply reads 
and writes the socket. Although the applications authenticate 
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and encrypt the stream, all the details are encapsulated within 
the SSL classes. Listing Two shows the corresponding server 
code (server.java). The protocol is straightforward: 


1. Client and server announce who they are (HELLO). 

2. Server says it is ready to receive the credit-card information 
(READY). 

3. Client sends the credit-card information. 

4. Server acknowledges receipt (OK). 

5. Client and server hang up. 


The server has a few messy details relating to listening on the 
socket and handing off connections to new ClientProcessor threads. 
Since this isn’t a tutorial on socket programming, I'll skip past 
that; download the sample code for the complete samples. 


$SLSocket 

The client side of an SSL connection is no easier/harder to set up 
than the server. They both can be pretty straightforward (remem- 
ber, it’s the keys that are the problem). Listing Three shows the 
main function for client.java. The function takes four parameters: 


e Socket type (plain or TLS). 

e Host to connect to. 

e Credit-card number. 

e Amount to charge the credit card. 


The socket type is important since it controls whether you 
use a standard socket or an SSL socket to connect to the serv- 
er. The if statement in line 1 creates an SSL socket if you pro- 
vide a socket type of Transport Layer Security (TLS)— the new 
name of SSL). Lines 2-10 set up an SSZLContext object. An SSZCon- 
text is primarily used to group together the many services it 
takes to authenticate and encrypt communications. 

The first to be created is a key-manager factory. The key- 
manager factory provides objects (KeyManagers) that manage 
particular types of key stores. Just as a keyring keeps your house 
keys organized, a KeyManager keeps the public and private keys 
needed for SSL communications organized. Lines 3—4 get the 
KeyManagerFactory and KeyStore up and running. A KeyStore 
contains the keys while the KeyManager knows how to provide 
access to the particular keys in the key store. The Java Key Store 
(JKS) is a binary file used to store keys. You could create oth- 
er key store types that store keys in different formats; SQL or 
LDAP databases, for example. Depending on how you set up 
the system, you may want to have the server process store its 
keys in some sort of production-level database while using the 
JKS for the clients deployed in the field. 

Lines 5—6 open the JKS using a passphrase. This passphrase 
was used to encrypt the key store when it was first created and 
must be supplied anytime the key store is accessed. Finally, Key- 
ManagerFactory is initialized with the key store. 

Lines 7—8 accomplish the somewhat simpler task of creating a 
TrustManagerFactory. Largely, it is easier because I reuse the same 
key store that was used to initialize the KeyManagerFactory. To 
finish up, the SSZContext finally gets what it needs in the way of 
key and trust managers. 


SSL Protocol 

A key ingredient of SSL is the ability to use public-key cryptog- 
raphy. Each party to the communication has two keys— one pub- 
lic key that can be freely sent to others and one private key it 
keeps to itself. The beauty of these systems is that anything en- 
crypted with the public key can be decrypted with the private 
key, and vice versa. However, data encrypted with the public key 
cannot be decrypted with the public key, which is why the key 
can be sent out freely (the same is also true for the private key). 
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When two programs make an SSL connection, the follow- 
ing occurs: 


1.Each program sends the other the program’s public key. 

2.If configured to do so, and ours is, the server investigates the 
key it was sent to make sure the client is authorized to con- 
nect to the server. 

3. If the client’s key was authenticated, the server creates a ses- 
sion key (a one-time key used to encrypt traffic for this sin- 
gle session). 

4.The server encrypts the session key with the client’s public 
key and sends it to the client. 

5. The client, meanwhile, verifies the server’s session key to make 
sure it is talking with the correct server. 

6. If everything checks out, the client accepts the session key from 
the server and decrypts it with the client’s private key. 


At this point, both the client and server are satisfied that they 
are talking to the authorized and correct entities and have ex- 
changed an encryption key they can use for this session. The 
encryption key is usually for some algorithm like DES, IDEA, or 
Blowfish. These algorithms are faster and use smaller keys than 
those used to implement public- or private-key systems. The se- 
curity isn’t significantly compromised though, because the ses- 
sion keys are only used once. Even if crackers were to break 
the key, they would only have access to that session’s data. They 
could not use the key to decrypt future or past sessions, and 
they could not masquerade as either party because they would 
not have access to either party’s private keys. 


Key Authentication 

Public keys are generally signed (the key is at least signed with it- 
self), and the client and server look at who signed the key to see 
if that person can be trusted. What’s important here is who you 
trust to sign keys. 

There is a file in the JAVA_HOME/jre/lib/security directory 
called “cacerts.” If the key used to sign the remote process’s 
public key is found in cacerts, then JSSE assumes the remote is 
okay and lets the connection proceed. If it isn’t found, JSSE shuts 
the connection, and the remote is left out in the cold. 

This is fine if you are implementing an HTTPS solution, but 
it is unsuitable for most purposes because cacerts is used by all 
JSSE applications and contains keys you didn’t create (such as 
the Verisign key normally used to sign keys to be used by HTTPS 
servers). If you used this default implementation, then any client 
who paid Verisign $400 to sign its key could then gain access 
to your system: No need to crack the client’s key— just pretend 
to be some other client with a valid key and gain access to the 
system. This isn’t good. 


Enter TrustManager 

You can override the default cacerts functionality by creating 
your own TrustManager, because it is the default trust manag- 
er that is using cacerts to authenticate the keys. The cacerts file 
is actually just another key store and, to create a new trust man- 
ager, you create a new TrustManagerFactory and provide it the 
same key store used before (line 9 of Listing Three). Now your 
client and server will consider any key acceptable that has been 
signed by a key in their key store. 


$SLServerSocket 

Listing Four is the server side of tle SSL connection. In lines 1-5, 
you create a server socket and start waiting for clients to connect. 
The interesting part is how you create the server socket factory. 
Lines 1-14 are the same as those in client.java. Remember that you 
are initializing the SSL context that is just a Java clearinghouse for 
all the information it takes to authenticate and encrypt a socket 
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(continued from page 22) 
connection. Instead of getting a socket factory from the context, 
you get a server socket factory and return it to the main method. 
Wrapping up the server, the main method passes the duties 
of creating the server socket and handling new connections from 
clients. Again, it is important to note that the acceptConnections 
method takes as a parameter a regular ServerSocket. Because 
SSLServerSocket is derived from ServerSocket, it can be passed 
to acceptConnections and used like any other ServerSocket. This 
means that it should be fairly easy to convert any plain socket 
application to one that uses SSL. 


Installation 

To use the sample code and develop your own applications, you 
need to install the JSSE package from http://java.sun.com/. Al- 
though you should read the installation instructions that come with 
the package, I'll give you the an express installation method to get 
‘you started. After you unzip the JSSE package, complete the fol- 
lowing steps: 


1. Copy jsse1.0.2/lib/* to JAVA_HOMEjre/lib/ext. 

2. Edit JAVA_HOME/jre/lib/security/java.security. 

3. Add the line security.provider.2=com.sun.net.ssl.internal.ssl 
Provider to the file. 


This gets JSSE working within Java with a minimum of fuss. 
(There are other ways to install JSSE, and this one might not be 
appropriate to your application.) 

For the sample code, merely unzip the archive. You will find 
a jsse_article directory with client and server subdirectories. The 
directories contain all the files you need to run the applications. 
To start up the server, enter $ java server. Then start up the client: 


$ java client plain localhost 5555-444-3333-2222 100.0 
Connected to server. 

Server ready. 

Data sent. 

$ 


The server console then displays: 


$ java server 
Client connected. 
Credit card authorized. 


The server writes the credit-card information to the transac- 
tions.log text file. If you have a packet sniffer handy, you will 
see what is shown in Figure 1. If the packet sniffer is being op- 
erated by someone with nefarious intent, then your customer 
could probably count on a credit-card bill larger than expected 
next month. 
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Figure 2: Ciphertext. 
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To fix that, kill the server and restart it with $ java server TLS. This 
tells the server to require all connections to be made through SSL. 
Start the client, and the displays on the console will be the same: 


$ java client TLS localhost 5555-444-3333-2222 100.0 
Connected to server. 

Server ready. 

Data sent. 


$ 


The first thing you will notice is that the transaction takes far 
longer to complete. There is considerable overhead associated with 
starting up an SSL connection. In your own applications, you can 
save time if you cache the appropriate SSLContext objects between 
connections. You can do this if your client application will remain 
running and occasionally connect/disconnect from the server. 

Now look at Figure 2 to see what the packet sniffer reveals about 
the conversation. The only text you can read is the information 
attached to the keys, and this isn’t much help to any cracker 
wannabes because it reveals nothing without the private keys. 


Key Management 

Recall that the code is not the hard part of using the JSSE pack- 
age. If you want to use JSSE to talk to HTTPS servers, then you 
are set, and the sample programs that come with JSSE will get 
you started. However, if you want to create your own servers 
and clients and authenticate both ends of the connection, then 
you have to know about key management. 

There are many ways to store keys for access by your clients 
and servers. The easiest is to put each server’s public key into the 
key store of each client, and each client’s public key into each 
server’s key store. This means everyone has each other’s key and 
all can communicate with each other. The reason this works is 
that each person’s public key is signed by their private key. When 
the programs validate the keys they were sent, they find that the 
key was signed by a trusted key in their key store. This gets to 
be a real problem if the applications involved can be both client 
and server. Every time you bring up a new client, you potential- 
ly have to go visiting a bunch of different nodes updating keys. 

Instead of putting every key in every store, I suggest you cre- 
ate a master key that you put in all the key stores when you in- 
stall the application. Then you use the master key to sign all the 
client and server keys you create. This way, when the applica- 
tions send their keys, the keys will be signed by a trusted key 
in the key store. This is essentially what Verisign does and why 
HTTPS works between your web browser and servers you’ve 
never contacted before. Verisign has signed the server’s key and 
Verisign’s public key is in your browser’s key store. You don’t 
actually want to have Verisign sign all your keys, because then 
anyone with a Verisign-signed key could interact with your client. 


Certificate Authority 

In a nutshell, what I’m suggesting is that you create your own Cer- 
tificate Authority (CA) to sign your keys. This gets complicated be- 
cause nothing in the Java Development Kit or JSSE lets you set up 
a CA and sign keys. You have to go elsewhere for tools to do this. 
I chose to go with the OpenSSL toolkit (http://www.openssl.org/) 
running on Linux. There are toolsets available from other vendors 
and platforms, however. If you choose to use a different toolset, 
you will just have to substitute the appropriate commands; the 
theory is the same no matter what. 

First, you need to generate your CA’s key. That key is used 
to sign all the other application keys. The OpenSSL toolkit comes 
configured to setup a CA from whatever directory you start it 
in. This means that you need to use all the CA commands from 
the same directory. In the sample code, you'll find the CA di- 
rectory that I used to generate the CA key and sign all the ap- 
plication keys: 
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(continued from page 24) 
1. Generate the CA key 
$ openssl genrsa -rand -des -out ca.key 1024 


2. Create a self signed certificate 
$ openssl req -new -x509 -days 305 -key ca.key -out ca.crt 


You are prompted for location information for the certificate. 
Enter whatever you want, but make sure you enter something 
for each field: 


3. Setup the OpenSSL CA tools 
$ mkdir demoCA 

$ mkdir demoCA/newcerts 

$ touch demoCA/index.txt 

$ cp ca.crt demoCA/ 

$ echo “01” > demoCA/serial 


You now can create the client application’s key store and export 
its public key so your CA can sign it. You can enter whatever you 
want for all the location information, but again make sure you 
enter something— standard alphanumeric characters and spaces, 
but no underscores or other special characters— for every field: 


4. Create a new key store for the client application 
$ keytool -keystore testkeys -genkey - alias client 


When prompted, enter passphrase for the password to use 
this keystore with the sample applications. 


5. Export the client’s public key 
$ keytool -keystore testkeys -certreq -alias client -file client.crs 


6. Sign the client’s key with our CA key 
$ openssl ca -config /etc/openssl.cnf -in client.crs -out client.crs.pem 
-keyfile ca.key 
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At this point, you should have a file called “client.crs.pem,” which 
is the signed public key. It needs to be converted to a format suit- 
able for the JDK’s keytool command, and then imported into the 
testkeys keystore: 


7. Convert to DER format 
$ openssl x509 -in client.crs.pem -out client.crs.der -outform DER 


8. Import CA certificate into client’s key store 
$ keytool -keystore testkeys -alias jsse_article_ca -import -file ca.crt 


9. Import signed key into client’s key store 
$ keytool -keystore testkeys -alias client -import -file client.crs.der 


Step 8 must be completed so that the keytool command agrees 
to import the signed key. While importing the signed key, key- 
tool checks the signatories to ensure that their signatures can be 
validated. They can be validated if their public keys are in the 
key store. 

Once you have completed all of these steps, move the festkeys 
key store to the client directory. Start over with step 4 and create 
a key store for the server process. Just substitute “server” every- 
where you see “client.” Make sure you enter something differ- 
ent in one of the location fields (organizational unit would be 
a good choice). 


Going Into Production 

For each new install of your application, just go through steps 
4—9, You'll likely perform steps 4-5 on the computer where 
the application is installed, then take the product (client.crs) to 
the computer you have designated as your CA and walk through 
steps 6-7. Finally, take the product of those steps, client.crs.der, 
and the CA’s certificate back to the new application. Perform 
steps 8—9 and you will be ready to communicate. 
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(continued from page 206) 


Each time you move a key from one computer to another, 
check its fingerprint to ensure it has not been modified. All 
of the major key commands will display the key fingerprint, 
which can be used to make sure nothing naughty has gone 
on behind your back. This is a necessity if you are not hand 
carrying the keys. If e-mail or some other transport is in use, 
the fingerprint is an effective guard against tampering. In a 
public-key system such as SSL with this key management sce- 
nario, not much could be more disastrous than accidentally 


signing a cracker’s key. 


Conclusion 





watch out for if you implement this system. The computer used 
as the CA is the weak link. If unauthorized access is gained to 
this computer, the CA key could be stolen or, at the very least, 
used to sign bogus keys. Once this happens, crackers would 
have easy access to your systems. They likely wouldn't be able 
to intercept the communications between client and server, but 
would be able to masquerade as a valid client or server. If this 
does happen, you will need to create a new CA key and gen- 
erate new key stores for all of your installs. This is really no dif- 
ferent than what would happen if the keys for any of the com- 
mercial CAs were stolen. If possible, put this computer in a 
secure room, do not connect it to a network, and carry the keys 
in and out on floppy disks. 


The code to convert a regular socket-based application to an 
SSL socket is pretty minimal. However, the issues involved in 
key management are more complex. There are some things to 








e e 
Listing One 
static void sendData( Socket server, String cc, String amnt ) { 
try { 
BufferedWriter out = new BufferedWriter( 
new OutputStreamWriter( server.getOutputStream() ) ); 
BufferedReader in = new BufferedReader ( 
new InputStreamReader( server.getInputStream() ) ); 
boolean success = false; 
String s = in.readLine(); 
if ( s.equals( "HELO CC-SERVER" ) ) { 
System.out.println( "Connected to server." ); 
println( out, "HELO CC-CLIENT" ); 
s = in.readLine(); 
if ( s.equals( "READY" ) ) { 
System.out.println( "Server ready." ); 
println( out, cc + "\t" + amnt ) 
s = in.readLine(); 
if ( s.equals( "OK" ) ) { 
success = true; 
} // af 
} f/ at 
V4 fae 
if ( success ) 
System.out.println( "Data sent." ); 
else 
System.out.println( "Protocal error. -- "+s ); 
server.close(); 
} catch ( Exception e ) { 
System.out.println( "Error: " + e.getMessage() ); 
e.printStackTrace(); 
} // catch 
} // send data 


e e 
Listing Two 
class ClientProcessor implements Runnable { 
Socket socket = null; 
ClientProcessor( Socket client ) { 
socket = client; 
} // constructor 
public void run() { 
try { 
BufferedWriter out = new BufferedWriter ( 
new OutputStreamWriter( socket.getOutputStream() ) ); 
BufferedReader in = new BufferedReader ( 
new InputStreamReader( socket.getInputStream() ) ); 
println( out, "HELO CC-SERVER" ); 
String s = in.readLine(); 
if ( s.equals( "HELO CC-CLIENT" ) ) { 
System.out.println( "Client connected." ); 
println( out, "READY" ); 
s = in.readLine() ; 
writeTransaction( s ); 
System.out.println( "Credit card authorized." ); 
println( out, "OK" ); 
} else { 
println( out, "IHATEYOU" ); 
} // else 
} catch ( Exceptione ) { 
System.out.println("Error: " + e.getMessage()); 
e.printStackTrace(); 
} // catch 
try { 
socket.close(); 
} catch ( Exception ex ) {} 
} // run 
void writeTransaction( String s ) throws IOException { 
FileWriter fw = new FileWriter( "transactions.log", true ); 
PrintWriter out = new PrintWriter( fw, true ); 
out.println( s ); 
out.flush () ; 
out.close(); 
fw.close(); 
} // writeTransaction 


static void println( BufferedWriter bw, String s ) throws IOException { 


bw.write( s ); 
bw.newLine() ; 
bw.flush(); 
} // println 
} // ClientProcessor 


Listing Three 


public static void main( String[] args ) throws Exception { 


String type = args[Q]; 
String host = args[1]; 
String cc = pe Fe 
String amnt = args([3]; 
Socket socket = null; 
1: if ( type.equals( "TLS" ) ) { 
SSLSocketFactory factory = null; 
2 SSLContext ctx = SSLContext.getInstance( "TLS" ); 
3 KeyManagerFactory kmf = KeyManagerFactory.getInstance( "SunX509" ); 
4; KeyStore ks = KeyStore.getInstance( "JKS") ; 
53 char[] passphrase = "passphrase".toCharArray() ; 
6: 
7 
8 


ks.load(new FileInputStream("testkeys"), passphrase) ; 
kmf.init(ks, passphrase) ; 
: TrustManagerFactory tmf = TrustManagerFactory.getInstance("SunX509") ; 
9: tmf.init( ks ); 


10: ctx. init (kmf.getKeyManagers(), tmf.getTrustManagers(), null); 
a13 factory = ctx.getSocketFactory(); 
123 socket = factory.createSocket (host, DEFAULT_PORT) ; 
} else { 
a3: socket = new Socket( host, DEFAULT_PORT ); 
} // else 
14: sendData( socket, cc, amnt ); 
} // main 


e e 

Listing Four 

public static void main(String args[]) { 

String type = "plain"; 

if ( args.length > @ ) { 
type = args[@]; 

} // it 

try { 
ServerSocketFactory ssf = server.createServerSocketFactory( type ); 
ServerSocket ss = ssf.createServerSocket( DEFAULT_PORT ); 
if ( type.equals( "TLS" ) ) 

((SSLServerSocket) ss) .setNeedClientAuth( true ); 

acceptConnections( ss ); 

} catch (IOException e) { 
System.out.println( "Error: " + e.getMessage() ); 
e.printStackTrace() ; 

} // try 

} // main 

private static ServerSocketFactory createServerSocketFactory( String type ) { 
if ( type.equals( "TLS" ) ) { 

SSLServerSocketFactory ssf = null; 
try { 
// set up key manager to do server authentication 


Oe WM 


6 KeyManagerFactory kmf = KeyManagerFactory.getInstance( "SunX509" ); 
73 KeyStore ks = KeyStore.getInstance( "JKS" e 
8: char[] passphrase = "passphrase".toCharArray(); 
9: ks.load(new FileInputStream("testkeys"), passphrase) ; 
19: kmf.init(ks, passphrase) ; 
11: TrustManagerFactory tmf = 
TrustManagerFactory. getInstance("SunX5@9") ; 
123 tmf.init( ks ); 
13% SSLContext ctx = SSLContext.getInstance( "TLS" ); 
14: ctx. init (kmf.getKeyManagers(), tmf.getTrustManagers(), null); 
15: ssf = ctx.getServerSocketFactory() ; 
16: return ssf; 


} catch (Exception e) { 

System.out.println( "Error: " + e.getMessage() ); 
e.printStackTrace(); 

} // try 

} else { 
return ServerSocketFactory.getDefault(); 
i Sf at 
return null; 
} // createServerSocketFactory 
static void acceptConnections( ServerSocket server ) { 
17: Socket socket = null; 
while( true ) { 
// accept a connection 
try { 
18: socket = server.accept(); 

} catch (IOException e) { 
System.out.println("Error: " + e.getMessage()); 
e.printStackTrace() ; 

return; 
} // catch 
// create a new thread to accept the next connection 
ii; ClientProcessor cp = new ClientProcessor( socket ); 
20: (new Thread( cp )).start(); 
} // while 
} // run 


DDJ 


28 Dr. Dobb’s Journal, February 2001 http://www.ddj.com 


Re 









Traffic Internet C 




























- servers, firewalls, e- 
and other network appliance 

















Ss 





BSDi's new TXtreme™ Series line of 1U, 2U, 4U, and 
rackmount servers and RAID systems offers excellent 
price/pertormance and Internet infrastructure-grade | 

reliability, availability and scalability at the industry's Towest 


+1-719-457-8400 
www.bsd.com 


total cost of ownership. Best of all. BSDi 











Why reinvent the wheel? 


[Use components already written and tested by experts] 


“ 
a 
ao 
Ss 
> 
o 
@ 
g= 7 
© ee 5 
we 
- @ 
a 
”“ 
=. 
Pa 
— 
_— 
o 
a 48 
= 
—s 
so 
= 
> 
so 
os 
a 
= 
oso 
2 
— 
oe 
cael 
AS 
7 
2 
— 
2 
a 
sc 
“Ss 
oe 
= 
° 
— 
= 
a 
& 
Ss 
— 
bel 
co} 
es 
oO 
—— 
Pots 
ae 
fmol 
< 
oo 
= 
—_ 





f 








Choose from 5,000 software components 
in over 100 categories 


ven 


© Download products immediately e Download free evaluations 
e Always in stock | © Detailed technical information 
© Personalized backup CDs also available © Online backup for every order 







Apply online for a corporate credit account. 
www.componentsource.com/corpaccount 


For further information, call toll free (888) 850-9911 
























“Our aviation customers rely on the iPortal Suite ee es 
to build some of the world’s largest, most ambitious e-business applications. = 


IONA's iPortal Suite products supply information technology organizations, in virtually every industry, with a platform for developing | 
and deploying enterprise portals and other large-scale e-business applications. : : - — 


























5 3 
oOo 2 
5 2 
OD «a 
= 5 
= = 
oa @ 
= = 
8 6 
2 oe 
ee 
o = 
$s =< 
so 
ee 
Bs 
= = 
— = 
oe 
aoe 
a3 


"Enterprise Portals: The Future of Your e-Business" at 


sts_overview.htm. 
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The JUnit++ 
Testing Tool 





Java integration, 
system, load, and 
Stress testing 





Siegttied Goeschl 


Unit is a freely available Java unit test 

framework created by Erich Gamma 

and Kent Beck. Currently at Version 

3.2, JUnit is extensively used in the 
Java community because of its elegance 
of design and ease of use. Over time, 
JUnit (available at http://members.pingnet 
.ch/gamma/junit.htm) has been extended 
to address issues related to integration, 
system, load, and stress testing. However, 
these extensions have inevitably resulted 
in shortcomings such as the lack of a test 
data repository, which stems from the orig- 
inal design. The absence of a test data 
repository results in hard-coded test data 
that increases the maintenance costs of 
testing database-driven systems. This 
should not be underestimated since the 
test code should have the same quality re- 
quirements as the application code. It is 
common to ship the regression test suite 
as part of the deliverables to increase cus- 
tomer confidence and supplement the doc- 
umentation. To address this and other lim- 
itations, we’ve extended the JUnit test 
framework to provide a test data reposi- 


Siegfried is a technical architect for Soft- 


ware Daten Service. He can be contacted 
at siegfried.goeschl@sds.at. 
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tory, command-line arguments, and an im- 
proved TestRunner class that supports a 
built-in repetition counter and multi- 
threading on the command line. This im- 
proved version, called “JUnit++” (available 


electronically; see “Resource Center,” 
page 5), has been used for more than a 
year to test an online banking applica- 
tion— currently at more than 100,000 
lines of code— at Software Daten Ser- 
vice (Vienna, Austria). 


Using the JUnit Extensions 

The main improvement of JUnit++ is the 
provision of a test data repository accessed 
by subclassing junit.extensions.Config- 
urableTestCase. Listing One illustrates how 
you can use the test data repository for a 
simple test. The class ConfigurableTest- 
Case uses java.util.Properties to load the 
test data. The loading of the property file 
is triggered in the constructor of Config- 
urableTestCase. For a master test suite that 
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only invokes other test suites, this strate- 
gy doesn’t work because no constructor 
is invoked. In this case, loading the prop- 
erty file has to be done manually by in- 
voking initTestProperties() in the suite() 
method; see Listing Two, which shows 
how to use the test data repository for a 
master test suite. 

How is the property file found when 
an instance of ConfigurableTestCase is ex- 
ecuted? It is assumed that the property file 
has the same name as the class file but a 
different extension; that is, the property 
file for the FooTest.class would be named 
FooTest.ini. The property file is either 
looked up in the current directory or in 
the list of directories defined in the class- 
path variable. 

In some projects this approach might 
not work (for instance, using a single 
property file for all test suites); therefore, 
the system property junit.data either de- 
fines a property file or a starting directo- 
ry for the search; see Example 1(a). How 
is an entry in the property file defined? To 
access the test data repository and the 
class name, the name of the test case and 
the property name are concatenated to 
generate the key in the following order: 


1. Fully qualified class name plus test 
name plus property name. 

2. Class name without package name plus 
test name plus property name. 

3. Class name without package name plus 
property name. 

4. Property name. 


This implementation allows the reuse 
of test data definitions shared by one or 
more test suites. For example, Example 
1(b) shows the name of the server used 
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(continued from page 34) 

for testing a client/server application with 
multiple test suites. In the case of a mas- 
ter test suite, the corresponding property 
file contains references to the property 
files of the contained test suites, as in Ex- 
ample 1(c), which are loaded recursively. 

What happens if a different type of 
test data repository already exists (as in 
a relational database)? In JUnit++, the 
implementation of a test data reposito- 
ry is determined and instantiated at run 
time. Hence, it is possible to provide a 
different implementation of a test data 
repository by defining the implementa- 
tion class at the command line, as in Ex- 
ample 2(a). After a successful test exe- 
cution, you might reuse the test suite to 
gather performance data either by sim- 
ply measuring the execution time or pro- 
filing the application. The profiling gives 
a good indication of how much time and 
resources are spent in an ORB run-time 
library or a Servlet engine. It is neces- 
sary to fine tune the number of test ex- 
ecutions to find a balance between the 
effect of one-time initializations distorting 
the result and time required to generate 
the profiling data. By using junit.exten- 
sions. TestRunnerEx, the number of test 
repetitions can be defined on the com- 
mand line in Example 2(b). 

If the profiling of a client/server appli- 
cation is done on one computer, it will 
become overloaded. In this case, a delay 
of the test case invocations can be defined 
on the command line in Example 2(c). 
The next step might be overloading the 
server until it crashes due to race condi- 
tions and/or resource shortage. This can 
be accomplished by invoking one or more 
test suites using multiple threads, repeti- 
tion counters, and test case invocation de- 
lays to simulate a more realistic user be- 
havior; see Example 2(d). 

Some components require command- 
line parameters for initialization. JUnit++ 





provides a uniform way of defining 





Example 1: (a) Specifying a test data repository; (b) content of a test data 


application-specific command-line pa- 
rameters by passing them as a system 
property junit.argv. Some development 
environments are unable to process quot- 
ed arguments when invoking the de- 
bugger. This is why multiple arguments 
can be separated by the pipe character, 
as in Example 2(e). 


Some 
components require 
command-line 
parameters for 
initialization 





Sometimes passing arguments via 
command line is clumsy, for example, 
when initializing a JOBC connection for 
multiple test suites. In this case, a Con- 
figurableTestCase can be used (see List- 
ing Three), where the parameters are 
stored in a corresponding property file. 

If a test suite does a lot of printing, it 
is useful to get a more verbose output 
of a test run. This can be accomplished 
by turning on the verbose mode of 
TestkunnerEx, as in Example 2(f). 


Testing an 

Online Banking Application 

The online banking application is a satel- 
lite system of the Global Entity Ordering 
System (GEOS), a host-based application 
consisting of 6 million lines of code writ- 
ten in C. It provides multiple front ends 
(Swing, HTML, WAP) and is implemented 
in Java utilizing CORBA, servlets, and XML. 
Figure 1 illustrates the application’s ar- 
chitecture. 






repository; (c) content of a test data repository for a master suite. 
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The GUI framework is based on the 
JWAM architecture (see “Framework- 
basierte Anwendungsentwicklung,” by 
Guido Gryczan, Carola Lilienthal, Martin 
Lippert, Stefan Roock, Henning Wolf, and 
Heinz Zullighofen, OBJEKT Spektrum, 
January 1999) and generates requests 
which are delegated to a Datasource. The 
Datasource, in turn, delegates the re- 
quests to the Transport Layer, which 
shields the client from the underlying 
middleware. The requests are transmit- 
ted to the CORBA-based Business Serv- 
er implemented by a CorbaBusinessOb- 
ject, which maps the requests to various 
Data Access Objects. The CorbaBusiness- 
Object and SecurityManager are part of 
an application framework developed in- 
house. The Data Access Object is ulti- 
mately responsible for contacting the 
host-based GEOS, which uses a relational 
database. The results are propagated back 
through the various layers until they are 
displayed by the front end. 

Using a bottom-up test strategy, the ar- 
chitectural layers are tested with the help 
of JUnit++ in reverse order, starting with 
the Data Access Objects, the CORBA ap- 
plication framework, and the Datasources. 

The regression tests for the Data Ac- 
cess Objects are organized by means of 
a hierarchy of test suites. Each test suite 
is derived from ConfigurableTestCase and 
retrieves the test data from a single prop- 
erty file for each database. By simply 
defining a different property file on the 
command line, the test suites can be run 
using a different database. 

The CORBA application framework re- 
gression test consists of 11 individual test 
suites that are invoked through a master 
test suite. The test suites are not only used 
for regular regression testing but also for 
stress testing the ORB and optimizing the 
performance. 


Two Examples 

The implementation of a data conversion 
module copying internal data structures 
to IDL data types was considered too 
slow—10 iterations of the original regres- 
sion test suite took 32 seconds to execute. 
By optimizing the code using a Java pro- 
filer, we were able to reduce the execu- 
tion time significantly: 


e Accessing the metadata of the internal 
data structure was accidentally done 
with a function that didn’t use a meta- 
data cache. By using cached metadata, 
the execution time was reduced to 22 
seconds. 

e Somewhere in the code, a linked list 
was used but the access was done by 
index. Changing the implementation to 
a java.util.ArrayList reduced the execu- 
tion time to 14 seconds. 
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Example 2: (a) Using a different implementation of a test data repository; i b) 





Starting TestRunnerEx with 10 repetitions; (c) executing tests with 10 repetitions 
and 100-ms invocation delay; (da) server crash test with a client executing 1000 
times with 10 threads simultaneously; (e) defining command-line arguments; 


(P) using the verbose mode. 


| JSPS/Servlets 





Figure 1: Application architecture. 


e A handcrafted function for creating a 
timestamp instead of using the Java API 
reduced the execution time to 10 seconds. 

e Some optimizations within the actual 
data transformation code reduced the 
execution time to 8 seconds. 


The regression test for the Datasources 
also acts as an integration and system test 
because this test exercises multiple layers 
of the application— the transport layer, 
middleware, business server, and Data Ac- 
cess Objects. By using JUnit++, this test 
can be easily turned into a stress test ex- 
ecuted over the weekend. 
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In a second series of tests, we used a 
test configuration to simulate the load of 
35,000 clients within nine hours so that all 
server processes and infrastructure com- 
ponents failed. A closer inspection re- 
vealed the following faults: 


e Some tests intentionally didn’t close their 
sessions to test the distributed session 
garbage collection triggered by a ses- 
sion timeout. These sessions never timed 
out due to a coding error. 

e The high load resulted in thrashing of 
the server processes and socket time- 
outs. The default behavior of the ORB 
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in case of a socket error is a fail-over 
to a replica server, which increased the 
load and the number of allocated 
threads further. 

e The high number of sessions, con- 
nections, and threads lead to a short- 
age of virtual memory, which crashed 
the CORBA Naming Service. 


Having multiple regression tests, it is rea- 
sonable to use the test suites in order to 
create a Daily Build and Smoke Test (see 
Rapid Development, by Steve McConnell, 


Microsoft Press 1996). This guarantees a cer- 
tain level of quality because the regression 
tests cover a large portion of the function- 
ality and span multiple layers of software. 


Conclusion 

JUnit++ makes it possible to separate test 
data from test code, which is essential for 
testing database- driven applications. Omit- 
ting this separation results in hard-coded 
test data, which increases the tests main- 
tenance costs and decreases the accep- 
tance of regression tests. 





The ease of generating an arbitrary 
load under various conditions encour- 
ages developers to use stress tests for 
components early in development, which 
in turn, increases the reliability of the fi- 
nal product. This observation is particu- 
larly relevant for distributed systems be- 
cause it might require an expert’s 
knowledge to trace a failure back to a 
single component. 
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Listing One 


public class FooTest extends ConfigurableTestCase { 
public FooTest ( String name ) { 
super (name) ; 


public void testFoo() { 
String stringValue 
Boolean booleanValue 
Integer integerValue 
Double doubleValue 


getString( "key1" ); 
getBoolean(-"key2" ); 
getInteger( "key3" ); 
getDouble( "key4" ); 


hou ou a 


} 


return suite; 
} 
} 


Listing Three 


public class JDBCTestSetup extends ConfigurableTestSetup { 
static private Connection connection = null; 
public JDBCTestSetup () {} 


public void setUp() { 


connection = DriverManager.getConnection ( 
getString( "url" ), 
getString( "user" ), 
getString( "password" ); 


Listing Two 


public class AllTests extends ConfigurableTestCase { 
public AllTests ( String name ) { 
super (name) ; 


public void tearDown() { 


) 
connection.close(); 


public Connection getConnection() { 
return connection; 
public static Test suite() { 3 
/ load property file } 
initTestProperties( AllTests.class ); } 


TestSuite suite= new TestSuite(); 
suite.addTest( FooTest.class ); 
suite.addTest( BarTest.class ); 
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The problem: Compiler runtime libraries allow only one thread at a time to be active in the heap. So on SMP systems, when 
multiple threads make concurrent heap requests, all but one will be blocked by the heap manager, nullifying the benefit of the 
extra CPUs. Worse yet, each time a thread is blocked, the OS invokes a context switch. The result: adding processors results in 
a vicious cycle of context switching that can prevent your app from scaling. 
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Import and export 60+ raster file formats - Supports a wide range 
of compression options, bit depths (up to 64) and color spaces. 
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non-image data. 


Image processing - Transforms- resize, rotate, linear and bicubic 
interpolation, flip, invert, reverse, crop, underlay, shear, transpose, 
auto deskew and combine. Filters - sharpen, intensity, saturation, 
histogram, posterize, median, edge, noise and more. Spacial filters - 
gradient, laplacian, sobel, prewitt and more. 


Display - scroll, scale with interpolation, dither, contrast, brightness, 
with a choice of over 2,000 special effects. 


Also includes scanning (TWAIN), capture, printing, screen capture, 
imaging common dialogs, thumbnail browser, image list, Internet and 
database imaging functions and much more! 
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DICOM 3.0 - Supports the latest specs, including all standard |OD 
classes and modalities (CR, CT, MR, NM, US, RF, SC, VL, Worklist, 
etc.) and DICOM directory. 


DICOM communications protocol - COMPLETE support including 
all Service Classes (Verification, Storage, Query/Retrieve, Patient 
Management, etc). High-level communications functions to simplify 
the creation of DICOM client/server applications. 


Optimized image processing - the richest in the industry supporting 
1,2,3,4,5,6,7,8,12,16,24 and 32 bit images. Includes 8-16 bit 
grayscale display with window leveling and LUT support. 


Also includes Medical specific annotations like cross product, point 
and protractor, allowing the measuring and mark-up of medical files. 
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Support for client server development over the Internet or LAN - 
Provides a framework for sending /receiving commands (LOAD, 
SAVE, CREATEWIN, etc.) from one computer to another. 


Small ActiveX controls and COM objects - Provides LEADTOOLS 
raster imaging functionality in Internet applications. 


rsions of LEADTOOLS for all major digital imaging categories including still frame color 
nae peck document imaging, medical imaging, Internet imaging and vector imaging. 
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Powerful annotation capabilities - enable the addition of text, 
highlights, sticky notes, audio, ellipses, buttons, lines, arrows, 
rectangles, polygons, redaction, hotspots, freehand scribble, 
pointers, bitmaps, stamps, rulers and hyperlinks all with user-defined 
security features. 

Document Image processing and Clean-up- Despeckle, deskew, 
inverted text, removal of dots, blobs, lines, borders, hole punches, 
character smooth. Region of interest, preview of changes and 
composite viewing of the modified regions. 

Optimized viewing of bitonal images - Specialized display filters 
including FavorBlack and ScaleToGray, bilinear and bicubic 
interpolation. 

Also includes ultra-fast CCITT G-3, G-4 and rotation, JBIG 
compression and a pan-window. 
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import/export - DWG, DXF, EMF, WMF, CGM, DGN, DRW, HPGL, 
HPGL2, PICT and LEAD VEC (LEAD's proprietary file format) in 
native form. 


‘Editing - 14 different primitive object types (Vertex, line, rectangle, 


polyline, polybezier curve, polygon, ellipse, circle, arc, text, pie, 
chord, polydraw, and raster). Group objects into layers, copy or move 
objects between layers, lock individual layers. Add, edit, delete, 
rotate, translate and scale objects and layers. Convert points from 
world space to screen space and vice versa. Pixel accurate hit 
testing. 


Also includes 3-D viewing (lighting, shadow, camera), 3 vector 
engines (GDI, OpenGL and DirectX), convert or overlay vector 
drawings to any LEAD supported raster format. 
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MPEG-1, MPEG-2 files. 


Capture - capture multimedia data from any Video for Windows or 
DirectShow capture device. 


PDF Plug-in 

Adds support for PDF, Postscript, and EPS formats. The LEADTOOLS 
PDF Plug-in allows for the loading, saving, viewing, rasterizing, and 
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LEADTOOLS Internet Imaging 
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another. With LEADTOOLS Internet Imaging developers can easily write 
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HTTP and FTP servers. 


LEADTOOLS Forms Recognition 


Provides full APIs with powerful forms recognition functions to aid in the 
creation of recognition applications. Gives developers the ability to 
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Storage capabilities for 
modern systems 





Chad Gibbons 


p to now, using a tape device in a 
Java environment has been almost 
impossible. No native support for 
tape devices exists in Java, and open- 
source libraries don’t seem to be available. 
Luckily, tape device access hasn’t been a 
pressing need for most Java developers. 
As Java expands its influence across en- 
terprises, however, tape device access will 
become a more common requirement. 
Providing a library for tape device ac- 
cess can seem daunting. The Java Native 
Interface (INI) must be used to call the 
local operating system’s tape device API. 
Each operating system treats tape devices 
differently, complicating the effort of de- 
veloping a library that acts consistently 
across all Java implementations. Tape de- 
vices themselves add to the complexity. 
Tape devices can support either fixed- 
or variable-length blocks and let an appli- 
cation change the block size before writ- 
ing. Most modern devices also allow a com- 
bination of fixed- and variable-length blocks. 
The commonly available QIC-format de- 
vices, however, allow only fixed-length 
blocks of 512 bytes. The tape library I'll pre- 
sent in this article allows an application to 
manage the block size, but the application 
must do the right thing based on its needs. 
Usually, the various on-tape formats, such 
as the Microsoft Tape Format (MTF) or the 
System Independent Data Format (SIDF), 
handle the block size issue. 
A single device usually supports dif- 
ferent media types. In the case of a QIC- 
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format device, only one type of media 
can be written, while many types of 
media can be read. In a more advanced 
device, such as a DAT or DLT, several me- 
dia types can be written transparently. 
Regardless of how devices handle dif- 
ferent media types, almost all support dif- 
ferent densities on a single media type. 
Density support varies greatly across op- 
erating systems and is often poorly im- 
plemented. Only compression can be 
modified on Windows NT. This limitation 
can make writing media that need to be 
read on another platform more difficult. 


On UNIX systems, different densities are 
usually available only by specifying a dif- 
ferent path name when opening the tape 
device. Luckily, for most purposes, using 
the default density is the best choice. 

Error handling is one of the more 
challenging issues in dealing with tape 
devices. Most UNIX platforms provide 
two different types of error handling: 
persistent and nonpersistent. Linux pro- 
vides only nonpersistent error handling. 
Nonpersistent error handling is often re- 
ferred to as “BSD behavior,” while per- 
sistent error handling is referred to as 
“System V behavior.” Windows NT sys- 
tems provide only nonpersistent error 
handling. 
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Persistent error handling forces an ap- 
plication to clear error conditions, such as 
end-of-file, before further operations can 
continue. Nonpersistent error handling re- 
turns an error once, and then continues 
processing with the next I/O operation. 
Typically, this only matters in how the 
end-of-file condition is handled while 
reading. A nonpersistent error handling 
environment automatically skips the end- 
of-file mark whenever it is reached. Per- 
sistent error-handling environments will 
only skip the end-of-file mark after the 
error has been cleared or when explicitly 
told to space over it. 

In this discussion, I focus on the basic 
operations of tape devices— reading, writ- 
ing, rewinding, and skipping to the end of 
data. Additional support operations, such 
as manipulating the block size and allow- 
ing writes past the logical end-of-media, 
are provided to allow realistic use. All of 
the code has been developed with Java 2 
compilers and run-time environments, but 
also works in Java 1 environments. 


Designing the Interface 
The java.net.Socket class provides an ex- 
cellent example of creating an interface to 
a device that requires complex control 
mechanisms and simple I/O streams at the 
same time. I'll follow this lead for the Ba- 
sicTapeDevice class, which provides all con- 
trol mechanisms and two methods to re- 
trieve an InputStream and an OutputStream 
for basic stream I/O. Persistent error han- 
dling is used in the interface regardless of 
the environment’s choice for error handling. 
The interface for the BasicTapeDevice 
contains the ability to construct, cleanup, 
configure, and perform stream I/O to the 
device. Example 1 shows the interface def- 
inition of this class. Construction of a Ba- 
sicTapeDevice requires a single String ar- 
gument specifying the path name of the 
tape device. I assume that the application 
or the user knows the correct device path 
name to use. The close method allows the 
application to cleanup and release the tape 
device resource on demand, rather than 
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(continued from page 44) 
waiting for the Java garbage collector to 
do it using the finalize method. 

The getinputStream and getOutputStream 
methods return an InputStream and Out- 
putStream object, respectively. Since an ap- 
plication might retrieve I/O streams sever- 
al times during its lifetime, both of the 
stream objects are kept internally in the Ba- 
sicTapeDevice object. By keeping internal 
references to these objects, the garbage col- 
lector cannot clean these objects and cause 
an unexpected call to the close method. 

The simplest tape movement commands 
are also provided in the interface. The 
abilities to rewind and to skip to the end 
of recorded media are critical for the ba- 
sic operation of tape applications. I’ve left 
out a discussion of the more advanced 
movement commands, such as skipping 
between files or individual blocks on tape, 
because of their complexity. 


The Implementation 

Listing One shows the implementation of 
BasicTapeDevice. The object contains sev- 
eral properties for maintaining state, two in- 
ner classes to provide I/O streaming, and 
the declarations needed for native interfaces. 


When the Java class loader loads the 
BasicTapeDevice class, a static block in 
the class is executed. The static block de- 
termines the appropriate JNI library to load 
based on the current operating-system 
name. The library name is in the form of 
Tape<osname>, such as TapeLinux, Tape- 
Solaris, or TapeWinNT. Once the library 
has been loaded, the native method init- 
Fields is called. The initFields method de- 
termines the field identifiers of all of the 
member variables of the BasicTapeDevice 
object that the native library manipulates. 
The private fd member variable of the 
FileDescriptor object used is also extract- 
ed. Luckily, JNI methods can access pri- 
vate methods of objects if they explicitly 
specify the full class name of the object. 
Example 2 shows the initFields method 
that can be used by any native library. 

The inner class TapeInputStream pro- 
vides an object that calls the appropriate 
native method for reading on behalf of 
the device object. Because the class is an 
inner class, it has full access to the private 
member variables of BasicTapeDevice. The 
implementation of TapelnputStream is sim- 
plistic— it fulfills the contract specified by 
InputStream and calls the native method 





Example 1: BasicTapeDevice interface. 
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2: Initialization of the native library. 
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tapeRead when necessary to perform in- 
put. The tapeRead method must set the 
eof member variable whenever end-of-file 
is encountered. Once end-of-file is en- 
countered, the stream will always return 
—1 for read operations until the end-of- 
file indicator has been cleared by the ap- 
plication by using the clearEOF method 
in BasicTapeDevice. Two consecutive end- 
of-file indicators signal that the end of 
recorded data has been reached. 

Like TapelnputStream, TapeOutput- 
Stream provides writing functionality to 
the parent device object. TapeOutput- 
Stream delegates writes to the native 
method tapeWrite. All writes must be a 
multiple of the block size specified by the 
getBlockSize method of BasicTapeDevice. 
If the block size is zero, then writes of 
variable length are allowed. Some oper- 
ating systems limit the maximum variable- 
length block to 64 KB, so you should de- 
termine if this will affect the tape format 
for your application. If a partial write to 
the device is encountered, TapeOutput- 
Stream ensures that all data is written to 
the tape media so the application never 
has to worry about it. 

One caveat of writing to tape devices 
is the separation between logical and phys- 
ical end-of-media. Most modern tape de- 
vices have a logical end-of-media indica- 
tor well before the physical end of tape. 
When a write detects this indicator, it in- 
forms the application to flush its buffers 
and write any end-of-media records re- 
quired by the tape format. When logical 
end-of-media is found by the TapeOut- 
putStream class, it will throw a Logical- 
EOMException to indicate this (see Listing 
Two). The application must then call the 
clearEOM method of BasicTapeDevice to 
allow further writes. Once physical end- 
of-media is reached, all writes will fail with 
an [OException. 

Other than a few support methods, the 
remainder of the BasicTapeDevice class is 
the declaration of the following native 
methods: initFields, tapeOpen, tapeClose, 
tapeRead, tapeWrite, tapeGetBlockSize, 
tapeSetBlockSize, tapeRewind, and tape- 
SpaceEOD. All of these methods must be 
implemented in the native library for each 
platform supported. 


Implementing the 

Linux Native Interface 

The file TapeLinux.c (available electroni- 
cally; see “Resource Center,” page 5) is the 
native interface library for Linux. The im- 
plementation of this library is straightfor- 
ward. The tapeOpen, tapeClose, tapekead, 
and tape Write functions are thin wrappers 
around the Linux system calls. The tape- 
Read and tapeWrite methods set the eof 
and eom member variables, respectively, 
whenever they encounter end-of-file or 
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(continued from page 40) 

logical end-of-media. Since Linux current- 
ly implements only nonpersistent error han- 
dling, I assume that file marks are skipped 
whenever end-of-file is encountered. Be- 
sides the common initFields method, the 
library also contains three utility functions: 
getFD, setFD, and throw. All of these meth- 
ods could be ported to any of the UNIX 
variants. 

The tapeGetBlockSize, tapeSetBlockSize, 
tapeRewind, and tapeSpaceEOD methods 
all use the Linux mtio API. In these meth- 
ods, either the MTIOCGET or MTIOCTOP 
ioctl functions are used to call the ap- 
propriate behavior in the tape API. The 
mtio API is similar across all UNIX vari- 
ants, yet can differ enough to make port- 
ing a chore. 


Implementing the 

Windows NT Native Interface 
TapeWinNT.c (also available electronical- 
ly) shows the native interface library for 
Windows NT/2000. Windows 9x environ- 
ments do not use the same tape API, and 
can’t make use of the library. 

The native interface library for NT is only 
slightly more complex than the Linux li- 
brary. Although the function names and 
structures are all different, the Win32 tape 
API is remarkably similar to traditional UNIX 
implementations. Integer file descriptors are 
replaced with HANDLE variables, but these 
are still 32-bit quantities easily stored in the 
java.io.FileDescriptor class. 

Windows NT has one annoyance that 
is not present on UNIX implementations. 
If a media change is detected at any time, 
the next tape API call will fail with the er- 
ror code ERROR _MEDIA_CHANGED. 
While knowing that the media has been 
changed is useful, the Win32 API saves 
this information between applications. If 
an application is run for the first time and 
it just opened the tape device, it is possi- 
ble to receive this error on the first tape 
operation performed. To solve this issue, 
I call the Win32 API function GetTape- 
Status while in the native tapeOpen 
method. It is still possible to receive the 
ERROR_MEDIA_CHANGED status while 
performing other tape operations, but I 
consider that to be a genuine error that 
should be thrown as an exception to the 
application. 


Performance 

You can create efficient I/O applications 
in Java. However, unlike disk or network 
I/O, tape I/O is especially sensitive to a 
buffer underrun condition. If a streaming 
tape device is starved of data then the 
tape drive will stall. A stalled tape device 
must stop the write head, rewind slight- 
ly, and then continue forward until a safe 
write position is found once again— the 
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infamous “shoe-shining” effect. Some op- 
erating systems and tape devices allow 
automatic padding of the buffer to pre- 
vent starvation. Adding support of this 
feature to your tape device code might 
be warranted in some applications. 

Buffering is always critical for I/O ap- 
plications, and especially for ones that 
write to tape devices. Without an adequate 
buffering mechanism in place, it’s very 
easy to stall a tape device and ruin your 
performance. In my testing, I’ve found 
that a 1-MB buffer size provides the best 
throughput with the smallest memory us- 
age. You may find that your application 
and hardware situation can live with as 
little as 64 KB of buffer, or require as much 
as 128 MB or more. 

It is more complex to implement 
buffered writes to a tape device compared 
to other devices. The java.io.Buffered- 
OutputStream class is useful, but does not 
guarantee any granularity on the actual 
output buffers written to the final output 
stream. If your tape device has a fixed 
buffer size, then you will immediately 
cause an error the first time BufferedOut- 
putStream attempts to write a block that 
is not a multiple of that block size. To 
solve this issue, I’ve created a Fixed- 
BufferedOutputStream class (also avail- 
able electronically). This class provides 
the same functionality as Java’s Buffered- 
OutputStream, but ensures that all writes 
to the tape device are a multiple of the 
specified block size. 

Most applications do more with data 
than write it to tape. Filter streams are 
commonly used in Java applications to 
calculate data checksums. These addi- 
tional filter streams can easily consume 
your CPU time if your Java run-time en- 
vironment does not have a Just-In-Time 
(GIT) compiler installed. Without a JIT, a 
simple output stream chained to a Di- 
gestOutputStream can consume seven 
times the CPU time as an environment 
with a JIT. Figure 1 presents some com- 
parisons of writing a 10-MB file to tape 
using a variety of JREs to demonstrate the 
importance of a good Java run-time en- 
vironment. 


Providing Full-Featured 

Tape Device Support 

The BasicTapeDevice class presented here 
is only the beginning of a robust tape sup- 
port library. Most applications that do 
more than dump data to tape will need 
the ability to space between the various 
files on tape. More advanced applications 
might require spacing between the records 
in a file and absolute logical and physical 
positioning. Controlling the various at- 
tributes of a tape device and its media, 
such as compression and density, are abil- 
ities most applications would find useful. 
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Figure 1: Writing a 10-MB file to tape, Linux 2.2.16, 400-MHz Pentium I. 


Extending the BasicTapeDevice class de- 
scribed here is a straightforward task and 
can be customized based on your appli- 
cation’s needs. The implementation de- 
scribed in this article, along with a more 
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advanced library, is also available at 
http://jtape.sourceforge.net/. 
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Listing One 


/* BasicTapeDevice. java */ 

import java.io.*; 

public class BasicTapeDevice { 
private FileDescriptor fd; 
private InputStream in; 
private OutputStream out; 
private boolean eof; 
private boolean eon; 
private boolean ignoreEOM; 


public BasicTapeDevice(String pathName) throws I0Exception { 
fd = new FileDescriptor(); 
tapeOpen (pathName) ; 
in = new TapeInputStream() ; 
out = new TapeOutputStream() ; 
eof = false; 
eom = false; 
ignoreEOM = false; 
} 
public synchronized void close() throws IOException { 
if (fd != null) { 


try { 
if (fd.valid()) { 
tapeClose(); 
} 
} finally { 
fd = null; 
} 
} 
} 
public InputStream getInputStream() throws IOException { 
ensureOpen() ; 
return in; 
} 


public OutputStream getOutputStream() throws I0Exception { 
ensureOpen() ; 
return out; 


} 

public int getBlockSize() throws IOException { 
ensureOpen() ; 
return tapeGetBlockSize(); 

} 

public void setBlockSize(int bs) throws IOException { 
ensureOpen() ; 
tapeSetBlockSize(bs) ; 


public void rewind() throws IOException { 
ensureOpen(); 
tapeRewind() ; 
} 
public void spaceEOD() throws IOException { 
ensureOpen() ; 
tapeSpaceE0OD() ; 
} 
public void clearEOF() throws IOException { 
ensureOpen() ; 
if (eof) { 
eof = false; 
/* assume that the file mark has already been skipped */ 
} else { 
throw new I0Exception("not at end of file"); 
} 
} 
public void clearEOM() throws IOException { 
ensureQpen(); 


if (eom) { 
ignoreEOM = true; 
} else { 
throw new I0Exception("not at logical end of media"); 
} 
} 
class TapeInputStream extends InputStream { 
private byte[] temp = new byte[1]; 
public int read() throws IOException { 
int n = read(temp, @, 1); 
if (n <= 6) { 
return -1; 
} 
return temp[@] & Oxff; 
} 
public int read(byte[] b, int off, int len) throws IOException { 
if (b == null) { 
throw new NullPointerException() ; 
} 
if (off < @ |; len < @ jj offtlen > b.length) { 
throw new IndexOutOfBoundsException() ; 


} 
if (len == 9) { 
return 9; 
J 
if (eof) { 
return -1; 
} 
ensureOpen() ; 
int n = tapeRead(b, off, len); 
if (n <= @) { 
return -1; 
} 
return n; 
} 
public long skip(long numbytes) throws IOException { 
return 9; 
} 


public void close() throws IOException { 
BasicTapeDevice.this.close(); 
} 
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class TapeOutputStream extends OutputStream { 
private byte[] temp = new byte[1]; 


public void write(int b) throws IOException { 
temp[@] = (byte) b; 
write(temp, 9, 1); 
} 
public void write(byte[] b) throws IOException { 
write(b, @, b.length) ; 
y 
public void write(byte[] b, int off, int len) throws I0Exception { 
if (b == null) { 
throw new NullPointerException() ; 


} 
if (off < @ |; len < @ |; offtlen > b.length) { 
throw new IndexOutOfBoundsException() ; 
} 
if (eom && !ignoreEOM) { 
throw new LogicalEOMException("logical end-of-media") ; 
} 
int n = tapeWrite(b, off, len); 
while (n < len) { 
n += tapeWrite(b, off + n, len - n); 
} 
} 
public void close() throws IOException { 
BasicTapeDevice.this.close(); 


} 
} 
protected void finalize() { 
try { 
close(); 
} catch (IOException ex) { 
} 
} 


private void ensureOpen() throws IOException { 

if (fd == null |; !fd.valid()) { 

throw new I0Exception("tape device is not open"); 

} 
} 
private static native void initFields(); 
private native void tapeOpen(String pathName) throws IOException; 
private native void tapeClose() throws IOException; 
private native int tapeRead(byte[] b, int off, int len) throws IOException; 
private native int tapeWrite(byte[] b, int off, int len) throws IOException; 
private native int tapeGetBlockSize() throws IOException; 
private native void tapeSetBlockSize(int bs) throws IOException; 
private native void tapeRewind() throws IOException; 
private native void tapeSpaceEOD() throws IOException; 


/* load the JNI library specific for this platform */ 
static { 
StringBuffer buf = new StringBuffer("Tape") ; 
String osName = System.getProperty("os.name") ; 
if (osName.equals("Windows NT") |; osName.equals("Windows 2000")) { 
buf. append ("WinNT") ; 
} else { 
buf. append (osName) ; 


System. loadLibrary (buf.toString()); 
initFields() ; 


Listing Two 


/* LogicalEOMException.java */ 
import java.io. IOException; 
public class LogicalEOMException extends I0Exception { 
public LogicalEOMException() { 
super (); 


public LogicalEOMException(String s) { 
super(s); 
} 
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he Java Addition to the Default Envi- 
ronment (J.A.D.E. for short) is an 
open-source library (available at 
http://jade.dautelle.com/) that fills 
gaps in the JDK core library. Among the 
extensions J.A.D.E. 1.0 includes are XML 
support for all Java classes, generic ma- 
trix classes, automatic error calculation on 
all operations (including numeric errors), 
more than 400 measurement units for al- 
most 40 different quantities, automatic unit 
simplification and verification, and sup- 
port for enumerated types. 
The reasons I developed the J.A.D.E. 
package include: 


e To eliminate more interface errors. If a 
method expects a Length, it gets a 
Length— not some float type that is sup- 
posed to be in inches. (This is the kind 
of problem that led to the 1999 Mars 
Climate Orbiter fiasco.) 

e Quantities for which precision is un- 
known are not very useful (especially if 
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they are the solutions of a possibly sin- 
gular system of linear equations). 

e To eliminate conversion errors. The U.K. 
gallon is different from the U.S. gallon, 
which is also different for liquid or dry 
products. 





e To introduce a Matrix class generic 
enough to resolve systems of linear 
equations involving any element: Real, 
Complex, Big Numbers, Quantities, and 
the like. 


In this article, T’ll focus on one of 
J.A.D.E.’s most useful components —XML 
support. 


XML and Java 
In Java and XML (O'Reilly & Associates, 
2000), Brett McLaughlin wrote: “Java rev- 
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Default Environment 


olutionized the programming world by 
providing a platform-independent pro- 
gramming language. XML takes the rev- 
olution a step further with a platform- 
independent language for interchanging 
data.” The question programmers are 
faced with is how to put the two to- 
gether. 

There are already many open-source li- 
braries available to parse XML documents. 
For example, using JDOM (http://www 
jdom.org/) you can represent XML doc- 
uments as Java objects. Plus, Sun is cur- 
rently leading a project that is code-named 
“Adelard” to generate Java classes from 
XML Schema. 

Unfortunately, the starting point for all 
of these tools is the XML document. More 
often than not, it is the other way 
around— you have Java classes and you 
want to provide persistency to your ex- 
isting classes using XML. Here’s how 
J.A.D.E. addresses this particular issue. 


Creating Objects from XML 

Say you wrote a package to represent two- 
dimensional areas. Your package possibly 
includes classes such as Point, Ellipse, Poly- 
gon, and Area. As an enhancement, you 
decide to add the capability to load an 
area from an XML file. 

Using J.A.D.E., all you need to do is 
to provide an XML constructor for all 
XML elements involved in the construc- 
tion process (in this case the geometri- 
cal objects). An XML constructor is noth- 
ing more than a Java constructor with 
two parameters— one for the attributes 
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(continued from page 52) 

and the other for the child elements. 
When the XML document is parsed, your 
constructor is called with the parameters 
set according to the element’s attributes 
and content. The attributes are simple 
properties of the object being created (rep- 
resented by a String), whereas the con- 
tent is the list of all nested elements that 
have been created recursively. 


The document 


handler uses the 
Reflection API 





Once you have written the XML con- 
structors (see Listing One), this code lets 
you create an area from a file. 


Constructor constructor = 
new Constructor("org.apache.xerces.par- 
sers.SAXParser"); 
Area area = (Area) constructor.create(file); 


As you can see, you need a SAX 2.0 
parser at run time. The binary distribu- 
tion of J.A.D.E. includes a subset of the 
Xerces 1.2.0 parser developed by the 
Apache Software Foundation, but any 
SAX 2.0 parser will do (for a list of SAX 
2.0 parsers go to http://www.megginson 
.com/SAX/). SAX 1.0 parsers will not 
work because J.A.D.E. supports name- 
spaces and SAX 1.0 does not. Name- 
spaces map directly to Java packages. 
Listing Two shows the XML representa- 
tion of an area without a namespace and 
Listing Three shows the same area rep- 
resented using a default namespace; both 
can be used interchangeably. Also, you 
may notice that because the class Area 
may contain instances of itself, the XML 
representation of an area may include 
nested area elements. 


How Does it Work? 
Under the hood, the document handler 
uses the Reflection API to dynamically cre- 
ate instances of the elements being parsed. 
How this API works is beyond the scope 
of this article (for more information read 
the Reflection API Tutorial at http://web2 
java.sun.com/docs/books/tutorial/reflect/ 
index.html). 

It is interesting to note that the con- 
structive approach used by J.A.D.E. works 
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even on immutable objects. Another ap- 
proach using interface (factory method 
pattern) has been studied but rejected be- 
cause of its lack of support for these im- 
mutable objects. 


Saving Objects in XML 

In addition to converting XML documents 
to Java objects, you can also do the re- 
verse. Using J.A.D.E., the instances of a 
class can be saved in XML format if the 
class implements the interface com.dautelle 
xml.Representable. Basically, the class has 
to provide two methods: getAttributes() 
and getContent( ). 

It is up to you to decide what is go- 
ing to be an XML attribute and what is 
going to be a child element. Don’t for- 
get, however, that if the XML represen- 
tation of your class is consistent with its 
XML constructor, then you will be able 
to save and retrieve any of your Java ob- 
jects. Listing Four shows the implemen- 


Listing One 


package com.dautelle.geom2d; 
public class Point { 

double x; 

double y; 

// XML constructor. 


public Point(Attributes attributes, Elements content) { 
x 


= attributes. getDouble("x") ; 
y = attributes.getDouble("y") ; 
} 


public abstract class Surface {} 


public class Ellipse extends Surface { 
Point center; 
double width; 
double height; 
// XML constructor. 


public Ellipse(Attributes attributes, Elements content) { 


center = (Point) content.get(@); 
width = attributes. getDouble("width") ; 
height = attributes. getDouble("height") ; 


} 

public class Polygon extends Surface { 
Point [] vertices; 
// XML constructor. 


public Polygon(Attributes attributes, Elements content) { 


vertices = new Point[content.size()]; 
content.toArray (vertices) ; 


} 

public class Area extends Surface { 
Surface [] surfaces; 
// XML constructor. 


public Area(Attributes attributes, Elements content) { 


surfaces = new Surface[content.size()]; 
content.toArray (surfaces); 


Listing Two 


<?xml version='1.9'?> 
<com.dautelle.geom2d.Area> 
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tation of com.dautelle.xml.Representable 
for geometrical objects. 

Writing a representable object is a 
two-step process. First, you will need 
to create an ObjectWriter. This can be 
done via: 


ObjectWriter ow = new ObjectWriterQ); 


Or if you want to use namespaces, you 
may pass the mapping between the pack- 
age names and the namespace prefix as 
argument: 


Properties namespaces = new Properties(); 
namespaces.setProperty(''com.dautelle.geom- 


2d", "geom2d"); 
ObjectWriter ow = new ObjectWriter(name- 
spaces); 


The second step is to perform the ac- 
tual writing. The XML encoding to be used 
depends on the output type: 


e UIF-8 encoding (1 byte per character) 
for java.io.OutputStream: 


TATA TOTAAL OOOO) 
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FileOutputStream out = new FileOutput- 
Stream(file); 
ow.write(area, out); // UTF-8 encoding. 


e UTF-16 (2 bytes per character) for 
java.io. Writer: 


StringWriter sw = new StringWriterQ; 
ow.write(area, sw); // UTF-16 encoding. 


Conclusion 

While Java has an excellent API, you can- 
not always count upon Sun to offer ev- 
erything you need (or you may wait a 
very long time). The J.A.D.E. library aims 
to complement the core JDK. It is open 
source and will hopefully grow with the 
contribution of all of you who want to 
continue to make Java a better platform. 


DDJ 


ELBE LEI DIE EEDIE DELL ELLE ELIS DELE LILES ASOD EPIL ISCOLELEE DS EEE SEN LIIESESE NLL SI LISS S EIS LOEDLSI ESOL LILLE NEL SESE DV ESE LEAVES GATES TIED I ERDAS LANES SANE VV DLE ODED LOADED LEN INI AS EAA TNT AES EOE ASRS EN ASSAD USA A aR REN AAS AURA ER AAU a A aR A UR UR A AACR a Ne ER AUN 





<Point x= "0.0" y="0.0"/> 


</Ellipse> 
<Polygon> 


<Point. x= "=1,0" y="=1.9"/> 
<Point x= "9.0" y="1.0"/> 
<Point x= "1.0" y="-1.0"/> 


</Polygon> 
<Area> 


<Ellipse width="1.0" height="20.0"> 


<Point x= "0.0" y="0.0"/> 


</Ellipse> 
</Area> 
</Area> 


Listing Four 


package com.dautelle.geom2d; 
import com.dautelle.xml.*; 
public class Point extends Representable { 


double x; 
double y; 


public Attributes getAttributes() { 
Attributes attributes = new Attributes(); 


attributes.add("x", x); 
attributes.add("y", y); 


return attributes; 


} 
return null; 


} 


public Representable[] getContent() { 


public abstract class Surface extends Representable {} 


public class Ellipse extends Surface { 


Point center; 
double width; 
double height; 


public Attributes getAttributes() { 
Attributes attributes = new Attributes(); 


attributes.add("width", 


width) ; 


attributes.add("height", height) ; 


return attributes; 


public Representable[] getContent() { 
return new Representable[] { center }; 


} 


<com.dautelle.geom2d.Ellipse width="1.0" height="20.0"> } 
public class Polygon extends Surface { 


<com.dautelle.geom2d.Point x= "0.0" y="0.0"/> 
</com.dautelle.geom2d.Ellipse> 
<com.dautelle. geom2d.Polygon> 
<com.dautelle.geom2d.Point x= "-1.0" y="-1.6"/> 
<com.dautelle.geom2d.Point x= "0.0" y="1.6"/> 
<com.dautelle.geom2d.Point x= "1.0" y="-1.0"/> 
</com.dautelle. geom2d.Polygon> 
<com.dautelle.geom2d.Area> 


Point [] vertices; 


public Attributes getAttributes() { 


return null; 


public Representable[] getContent() { 


return vertices; 


} 


<com.dautelle.geom2d.Ellipse width="1.0" height="20.0"> } 
public class Area extends Surface { 


<com.dautelle.geom2d.Point x= "0.0" y="0.0"/> 
</com.dautelle.geom2d.Ellipse> 
</com.dautelle.geom2d.Area> 
</com.dautelle.geom2d.Area> 


Listing Three 
<?xml version='1.0'?> 


<Area xmlns="java:com.dautelle.geom2d"> 
<Ellipse width="1.0" height="20.6"> 
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Surface [] surfaces; 


public Attributes getAttributes() { 


return null; 


public Representable[] getContent() { 


return surfaces; 
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A Java 2 Network 
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Dynamically 
loading classes 





Lorenzo Bettini and 
Donato Cappetta 


ava is useful for writing distributed ap- 

plications, thanks to its independence 

from the underlying operating system 

and computer architecture, not to 
mention all the classes that the Java library 
provides — especially those in the java.net 
package. Moreover, Java provides a means 
to synchronize multiple threads, which is 
useful in distributed applications. 

But when it comes to distributed ap- 
plications, one useful feature is that Java 
loads classes dynamically (that is, at run 
time, when they are needed), so it is pos- 
sible for a program to load classes that 
hadn’t even been created when the pro- 
gram was first written. Typically, Java au- 
tomatically loads classes dynamically, but 
the language also lets you write cus- 
tomized class loaders if you want to load 
classes from another source (instead of 
the ordinary paths in the CLASSPATH). 

In this article, we present Network- 
ClassLoader, which lets you load classes 


Lorenzo is a Ph.D. student in computer 
science at University of Florence, Italy. He 
can be contacted at bettini@dsi.unift.it. 
Donato is a software engineer at Intrage 
S.p.A., an Italian web company. He can 
be contacted at cappetta@infomedia. it. 
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from a (possibly remote) server. This serv- 
er can be seen as an application server 
that provides its applications to every client 
that requires them. We'll implement in Java 
a mechanism similar to those of the Java- 
enabled browsers. In this scenario, a client 


can require the main class for an appli- 
cation residing on a server. Then, when- 
ever a new class or a resource is needed 
by the application, it is automatically re- 
quested to the remote server, provided it 
is not present in the local file system of 
the client. Thus, when a new version of 
an application has to be installed, it will 
only have to be installed or updated in 
the server filesystem. 


How to Write Your Own Class Loader 

In his article “Java Custom Class Loaders” 
(DDJ, June 2000), Brian Roelofs present- 
ed specific steps for creating customized 
class loaders. Consequently, we will sim- 
ply provide an overview of the main parts 
of this process: Whenever a class A is 
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needed during the execution of your pro- 
gram (if it’s not already loaded), the class 
loader that first loaded the class that needs 
class A, say class B, is required to load 
class A. If your class loader has loaded 
class B, then it is up to your class loader 
to load class A and to return a Class ob- 
ject for that class. Actually, the classes that 
can be found in the local filesystem should 
be loaded by the ordinary Java class load- 
er, also called the “primordial class load- 
er.” It could be dangerous to load a class 
in a java.lang package from another source 
because those classes gain many access 
privileges. Moreover, loading two classes 
belonging to the same package from dif- 
ferent sources could cause some run-time 
errors (the main trick to show that “Java is 
not type safe” was to load classes com- 
piled in different moments that would 
cause a type error if loaded together). Ad- 
ditionally, loading the same class twice 
would produce a run-time error. So be- 
fore loading a class that is not found in 
the local filesystem, the class loader’s class 
cache should be checked to test if that 
class is already there; that is, if it has al- 
ready been loaded. In Java 1.1, it was your 
job to take care of all these problems. In- 
deed, you had to inherit from java.lang 
ClassLoader and implement the abstract 
method /oadClass, which is called when a 
class is to be loaded. This method has to 
perform the following steps: 


1. Check if the class has already been 
loaded by inspecting the class cache of 
the class loader. 

2. Check to see whether the class can be 
loaded by the system class loader. 

3. In case the previous tests have failed, 
try to load the class from a known 
source. 
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if (byte_code == null) 


} 


loader. 


(continued from page 58) 

4. Define the class; that is, create a Class 
object from the byte code (obtained at 
the previous step). 

5. Check whether the class has to be re- 
solved (inked), and in that case, re- 
solve it. 

6. Return the Class object to the caller. 


As you can see, step 3 is the one that 
you want to customize, while the other 
ones are standard (and, therefore, should 
be executed exactly that way to provide 
a correct and safe implementation). 

In Java 2, you are fortunately free of 
many of these burdens. The main part of 
the customization is not the loading of a 
class, but fetching the raw data to create 
a Class object from somewhere (a par- 
ticular location in the filesystem, for ex- 
ample, the Internet). In fact, the only 
method that has to be redefined is find- 
Class(String className), which is called 
if a class is found in neither the loader 
cache nor in the local filesystem (that is, 
by the primordial class loader). This 
method is an implementation of the pat- 
tern Template Method: The base class im- 
plements the main algorithm, allowing 
you to customize certain steps without 
having to worry about the algorithm it- 
self. Thus, Example 1 is a typical imple- 
mentation of this method. 

With this new model, you can also chain 
together many class loaders: The con- 
structor of ClassLoader can also be passed 
another ClassLoader that will become its 
parent class loader. Requests to load class- 
es are delegated to the parent. If the par- 
ent fails to load them, it will delegate them 
to the child by calling findClass. This could 
also be seen as an implementation of the 
Decorator pattern. findClass should always 
throw an exception if it cannot find the 
class so that another class loader can have 
a chance to find it. 


The Architecture of the System 
NetworkClassLoader is the client part of 
our client-server system, while the class- 
es are stored in and provided by the class 
server (ClassServer). 


e The server is basically a multithreaded 


server that continuously listens on a cer- 
tain port for incoming connections. 
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protected Class findClass (String className) throws ClassNotFoundException 
byte[] byte_code = loadBytesFromMySource (className) ; 


throw new ClassNotFoundException(className) ; 
return defineClass(byte_code, ...); 


Example 1: An implementation of the method findClass in a customized class 


Upon receiving a connection request, it 
spawns a concurrent thread, which will 
take care of that connection. It provides 
classes to the clients upon request. 

e The client instantiates a network class 
loader, specifying the Internet address 
(host and port) of the class server. 
Whenever a class that cannot be load- 
ed by the system class loader is need- 
ed, it will request it to the server. 


The classes of our system are contained 
in the package /oader. 


The Client 

The client program has to create a Net- 
workClassLoader (Listing One). It typical- 
ly loads the first class (which is not sup- 
posed to be in the local filesystem) 
through this loader’s method /oadClass 
and then it creates a new instance by us- 
ing the method newlnstance of the Class 
object returned by the loader: 


ClassLoader loader = new NetworkClass 

Loader(hostName, port); 
Class c = loader.loadClass(className); 
Object main = c.newInstanceQ); 


The Internet address of the remote 
ClassServer is passed to the constructor of 
NetworkClassLoader. className is a String 
that specifies the name of the class to be 
loaded. The object created by new/nstance 
is not assigned to a reference of class class- 
Name. This cannot be done because it is 
assumed that the class we are trying to 
load from the Internet is not present in our 
class path. In any case, it is still impossi- 
ble to assign such an object to a reference 
of the same type. You could force the load- 
er to load the class from the Internet, even 
if it's present in the local class path. But in 
this case, the class of the reference would 
be loaded by the system class loader (that 
is, from the local filesystem), while the ob- 
ject would be instantiated by a Class load- 
ed by NetworkClassLoader. The problem 
is that two classes loaded by different class 
loaders are incompatible (due to security 
reasons, these classes will belong to dif- 
ferent namespaces). 

If using an Object is too restrictive (be- 
cause even after the instantiation, it is nec- 
essary to set some attributes of the object), 
you could use a common superclass (or an 
interface). For instance, if there is an inter- 
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face, AppFoo, that is present in the local 
filesystem, and the class that we are going 
to load from the Internet, say RealApp, is 
known to implement this interface, it would 
then be possible to execute this code: 


Class c = loader.loadClass('RealApp"); 
AppFoo app = (AppFoo) c.newInstance(); 
app.setTitleC'My Application"); 
app.openFile("foo.txt"); 


app.run(); 


In fact, AppFoo (class or interface) 
would be loaded by the system class load- 
er, So it would be common both to the 
reference and to the object created 
through the Class object returned by Net- 
workClassLoader. 

We'll now examine the method /ind- 
Class. In our implementation, the byte ar- 
ray with the contents of a .class file is 
downloaded from the network, and so the 
method findClass (see Listing One) looks 
something like this: 


if (connected) 
connect(); 
classBytes = 
loadClassFromServer(className); 
classClass = 
defineClass(className, classBytes, 
0, classBytes.length); 
return classClass; 


The first time, a connection is estab- 
lished (method connect) with the 
ClassServer. To request a class’s byte code 
to the server, we send a Resourcekequest, 
through serialization, with the name of 
the class. The bytecode (byte array) will 
be returned in a ResourcePacket. If an 
error occurs in the server, it is commu- 
nicated in the response packet. Then the 
byte array is transformed in a Class ob- 
ject by calling the method defineClass, 
implemented in the superclass. 


The Server 

ClassServer is essentially a multithreaded 
server that listens for incoming connec- 
tions on a predefined port. Every time a 
connection request is received, a new 
thread (WorkerClassServer) is spawned, 
and the server keeps on listening for con- 
nections. Thus, many clients can be 
served concurrently. In fact, every client 
will be served by a distinct Worker- 
ClassServer. WorkerClassServer (available 
electronically; see “Resource Center,” 
page 5) reads a Resourcekequest from the 
socket connected to the client, loads the 
contents of the .class file (method get- 
ClassBytes) for the requested class, and 
then it sends back a ResourcePacket with 
the byte array (the contents of the file). 
The name of the class is specified in the 
standard Java format (package names are 
separated by dots), but it is transformed 
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Many companies take a piecemeal approach to managing their 
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resulting scenario can be a nightmare, with team members 
drifting further and further apart as their development task 
progresses. But Manfred 
Heiss, manager of product 
development for IXOS 
Software AG, has found a 
solution. The Perforce 
Software Configuration 
Management (SCM) system 
offers IXOS a powerful 
yet easy-to-use system 
for tracking and managing 


“Perforce is very 


reliable and stable, 
which is absolutely 
necessary for us, because 


275 developers depend 

; 7 the evolution of a soft- 
on this system. ware product — no matter 
how large, and no matter 
how many developers are 


working on it. 


—Manfred Heiss 


Why IXOS needed an SCM system 

IXOS Software AG is the worldwide leading provider of SAP™ 
R/3 software for document management. The company develops 
software in Germany, California, and the Czech Republic, and 
has teams working on Windows, UNIX, and Macintosh 
computers. According to Heiss, "At present, IXOS provides five 
products, each with more than one new version a year. This adds 
up to more than 120,000 files and several million lines of code." 
Developers at IXOS needed a system to synchronize their 
development teams. They required an SCM tool that could scale 
with their company's growing product line. Heiss says, "The 
whole selection [process] took us about five months of hard 
work. But it was worth it." From its start in a single 20-member 
team, Perforce is now used by more than 275 developers, testers, 
and writers at IXOS. 


How they got started 

IXOS purchased 150 licenses for its Munich development team. 
One of the company's most experienced developers, Gabriel 
Grebenar, learned to use the Perforce system, installed a new 
SUN UltraSPARC as the Perforce server, and over the course of 
a single Sunday afternoon transferred his team’s code from 
their existing file version management system. He then led his 





team in a two-hour training class and offered weekly half-hour 
question-and-answer workshops. In just one month, all team 
members were experienced Perforce users. 


“From that two-hour class, our first team used Perforce with 
great success," Heiss says. “Everyone on the team was delighted 
with Perforce for its pleasant GUI and its increased speed over 
any system they had ever used.” 


How they grew 

Using the same successful training procedure, IXOS soon 
introduced four more development teams to Perforce. “After 
that, all of the rest of our developers became curious and wanted 
to migrate their projects to Perforce as soon as possible. We had 
no problem introducing Perforce or winning acceptance for the 
new SCM system," Heiss says. Within 10 months of its first 
introduction of Perforce, IXOS purchased an additional 125 
licenses for its branch offices. 


How they configure Perforce 

All of the development teams work on a single Perforce server 
(SUN UltraSPARC), connected by fiber optics to another identical 
workstation and mirrored once a day to protect from failure. 
Some developers work part-time from their home offices, 
connecting to the server over ISDN phone lines using the 
Perforce compression mode. These workers have no problem 
connecting to the system, and are very satisfied with it. 


Why they like it 

"Perforce is very reliable and stable, which Is absolutely necessary 
for us, because 275 developers depend on this system,” Heiss 
concluded. "We didn't have a single day of downtime during the 
last year, and we made only one support call. This is a great 
improvement over every single SCM system we've used before.” 
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Headquarters: Munich, Germany / Industry: Software 
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Perforce software products used: Perforce 99.1 

Type of machines: SUN UltraSPARCs as Perforce servers, Windows NT, UNIX, and 
MacOS as Perforce clients 

Type of network: 10 and 100 Mbps 

Number of users: 275 (a mixture of developers, testers, and 
documentation writers) 

Number of development sites: Main site in Munich, Germany; branch offices in 
Prague, Czech Republic; Leipzig, Germany; and San Mateo, California 

Number of files: 120,000 / Languages used: C, C++, Java, etc. 

Perforce customer since: February 1999 
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in a local filesystem path by using the 
underlying operating-system file path sep- 
arator (obtained through the property 
file.separator). 

A cache of .class file contents is shared 
by all WorkerClassServer, so if a class is 
requested by two clients, it is loaded from 
the filesystem only once. 

The file is searched in the local class 
path by calling the static method Class- 
Loader. getSystemResourceAsStream(class- 
Name), which returns an InputStream as- 
sociated with the file searched in the paths 
specified in the class path. 


Resources 

Loading classes is not the only responsi- 
bility of a class loader— it also locates 
resources requested by classes. These re- 
sources are typically image files, audio 
files, or everything a class requests 
through the method Class.getResource- 


oleae a! 


(String name), which returns a URL ob- 
ject. For instance, you can load a GIF im- 
age for an icon like this: 


ImagelIcon image = new Imagelcon(get 
Class().getResource('openFile.gif")); 


Or you can play a .au file like this 


AudioClip audioClip = 
Applet.newAudioClip(getClass(). 

getResource("'spacemusic.au")); 
audioClip.playQ; 


The resource name is automatically 
changed by prepending the package 
name and converting every “.” into “/”. 
Quoting from the Sun Java documenta- 
tion: “The rules for searching resources 
associated with a given class are imple- 
mented by the defining class loader of 
the class.” After the modification to the 
name of the resource, Class.getResource 
invokes the homonymous method of its 
own class loader. Instead of redefining 
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Figure 1: The application frame and two terminals. The upper one is executing 
the server, the other is executing the loader. 
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this method, the delegation model is 
once again used. If the resource is not 
found by the parent class loader, the 
method findResource(String name) is in- 
voked. This is the method that the cus- 
tomized class loader has to redefine in 
order to find the resource and return a 
URL for it. 

In our implementation, the resource is 
requested to the class server (again by 
means of a ResourceRequest), and if ob- 
tained, it is saved on the local file system 
as a temporary file. Then the URL of the 
newly stored resource is returned so that 
the parent class loader is able to return 
the resource to the class. Even in this case 
a table of resources is kept so that an al- 
ready stored resource is not requested 
again to the server. As we use the method 
File.createTempFile, those temporary files 
will be stored in the system’s temporary 
file directory and automatically removed 
when the JVM terminates. 


Requesting and Getting a File 

The loader will request a class or a (bi- 
nary) resource to the server by means of 
Resourcekequest, and will receive the re- 
sponse (and possibly the byte array) by 
means of ResourcePacket. Upon request- 
ing a file, the type of the file (only CLASS 
or BINARY in our implementation) has to 
be specified. This differentiation is nec- 
essary because the server translates class 
file paths by replacing “with “/” and by 
appending “.class,” while resource paths 
are left unchanged (the right path is al- 
ready provided by Class.getResource). If 
there is a problem, the class server may 
specify the error in ResourcePacket (for 
instance, the requested class or resource 
is not available on the server, or the re- 
quest is not valid). 


An Example 

To test NetworkClassLoader, we've pro- 
vided a TestApp in the package loader- 
app. This is a simple application (using 
the Swing library) that uses images and 
sounds. To test the application on a sin- 
gle machine, you need to locate the load- 
erapp package where the class server can 
find it, but the client cannot. For instance, 
the loader classes can be put in a direc- 
tory in the CLASSPATH (say, “myclasses”), 
while the loaderapp classes are put into 
a temp directory (say, ‘temp”). By mak- 
ing sure that the current directory (.) is in- 
cluded in the CLASSPATH, you can launch 
the class server from the temp directory 
and the client from a different directory. 
Images and audio files have to be placed 
in the same place where the .class files of 
loaderapp are placed. Two classes are 
provided to launch the class server and 
NetworkClassLoader, respectively: load- 
er.RunServer and loader.RunLoader. The 
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server can also be passed the port number 
for incoming connections. RunLoader is 
simply an application that takes the address 
and port number of the server and the 
name of the class to be loaded as param- 
eters. It creates a NetworkClassLoader, and 
once the class is loaded, an instance of that 
class is created. Obviously, the Network- 
ClassLoader could be instantiated directly 
in your own application. (Additionally, 
many NetworkClass Loaders, communicat- 
ing with different servers, could be instan- 


For example, the server could be start- 
ed like this: java loader.RunServer 9999. 
The loaderapp.TestApp could be loaded 
(and thus started) by this: java loader.Run- 
Loader localhost 9999 loaderapp.TestApp. 
The loader prints all the classes that are 
loading on the screen (including system 
classes), and it specifies which ones are 
loaded from the server; the loader does 
the same with resources. The class server, 
on the other side, prints all class and re- 
source requests. You will notice that both 
classes and resources are requested only 
once to the server. Moreover, the loader 


will print class names using indentation, 
thus showing that loading a class A may 
require loading many other classes before 
the loading of A is completed. Figure 1 
shows the application frame and two ter- 
minals: The upper one is executing the 
server, and the other is executing the load- 
er. The printing of class and resource 
names is the only reason why we also re- 
define the method JoadClass. 
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Listing One 


package loader; 


import java.io.*; 
import java.net.*; 
import java.util.Hashtable; 


public class NetworkClassLoader extends ClassLoader { 


private String hostName = null; 

private int serverPort; 

private Socket socket = null; 

private ObjectInputStream is = null; 

private ObjectOutputStream os = null; 

private boolean connected = false; 

private int tab = -1; // just to print with indentation 


private Hashtable resourceTable = new Hashtable(); // key name, value File 


public NetworkClassLoader() { 
this("localhost", 5859); 


public NetworkClassLoader (String hostName, int serverPort) { 
super (); 
this.hostName = hostName; 
this.serverPort = serverPort; 


protected Class findClass(String className) 
throws ConnectClassServerException, JavaPackageException, 
ClassNotFoundException, ClassFormatError { 
byte[] classBytes = null; 
Class classClass = null; 
// try with the network server 
try { 
// connect to the ClassServer 
if (!connected) 
connect () ; 
classBytes = loadClassFromServer (className) ; 
} catch (IOException ioe) { 
disconnect () ; 
throw new ConnectClassServerException(ioe.toString()); 
} 


// convert the byte array into a Class and put it in the cache. 


classClass = defineClass(className,classBytes,0,classBytes.length) ; 


if (classClass == null) 
throw new ClassFormatError (className) ; 
Print(className + " loaded from the SERVER") ; 
return classClass; 
} //end loadClass() 
protected URL findResource(String name) 


URL resourceURL; 
try t 
File localResourceFile = (File) resourceTable. get (name) ; 
// we have to download it 
if (localResourceFile == null) { 
Print ("findResource: " + name + " at the SERVER"); 


byte[] resourceBytes = loadResourceFromServer(name, "BINARY") ; 


if (resourceBytes == null) { 
Print("Resource " + name + " not found on server!"); 
return null; 


localResourceFile=createLocalResourceFile (name, resourceBytes) ; 


resourceTable.put(name, localResourceFile) ; 
Print("stored locally: " + localResourceFile) ; 
J 
return getLocalResourceURL(localResourceFile) ; 
} catch (Exception e) { 
Print("Exception " + e); 
} 


return super.findResource (name) ; 


protected URL getLocalResourceURL(File file) throws MalformedURLException 


return file.toURL(); 


protected File createLocalResourceFile(String name, byte[] bytes) 
throws MalformedURLException, FileNotFoundException, IOException 
{ 


File resFile = File.createTempFile 
("__temp_res_", "_" + createLocalResourceName(name)) ; 
resFile.deleteOnExit(); 


FileOutputStream fostream = new FileOutputStream(resFile) ; 
fostream.write(bytes, 0, bytes.length) ; 
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fostream.close(); 
return resFile; 


protected String createLocalResourceName(String name) 
return name.replace('/', '_'); 


public synchronized Class loadClass(String name, boolean resolve) 
throws ClassNotFoundException 


{ 
++tab; 
Print ("Loading class " + name); 
Class result = super.loadClass(name, resolve) ; 
-—babs 
return result; 
} 


protected void Print(String s) 


for (int i = @; i < tab; ++i ) 
System.out.print(" "); 
System.out.println(s) ; 


protected void connect() throws UnknownHostException, IOException { 
System.out.println("Connecting to the ClassServer " 
+ hostName + ":" + serverPort) ; 
socket = new Socket (hostName, serverPort) ; 
connected = true; 
os = new ObjectOutputStream(new 
BufferedOutputStream(socket.getOutputStream())); 
os.flush(); 
is = new ObjectInputStream(new 
BufferedInputStream(socket.getInputStream())); 


System.out.println("Connected") ; 
} //end connect () 
protected void disconnect () { 
try { 
connected = false; 
os.close(); os = null; 
is.close(); is = null; 
socket.close(); socket = null; 
} catch (Exception e) { 
e.printStackTrace(); 
E 
} //end disconnect () 
protected byte[] loadClassFromServer (String className) 
throws ClassNotFoundException, SocketException, IOException 
{ 
byte[] classBytes = loadResourceFromServer(className, "CLASS") ; 
if (classBytes == null) 
throw new ClassNotFoundException (className) ; 
return classBytes; 
} //end loadClassFromServer () 


protected byte[] loadResourceFromServer(String resourceName, String type) 


throws FileNotFoundException, ClassNotFoundException, 
SocketException, IOException 


byte[] fileBytes = null; 
// load the file data from the connection 


// send the name of the file 
sendRequest (resourceName, type); 


// read the packet 
ResourcePacket resourcePacket = (ResourcePacket) is.readObject(); 


if (! resourcePacket.isOK()) 
throw new FileNotFoundException(resourcePacket.getError()); 
fileBytes = resourcePacket.getResourceBytes(); 
return fileBytes; 
} //end loadResourceFromServer () 


protected void sendRequest (String name, String type) 
throws IOException 


os.reset(); 
os.writeObject (new ResourceRequest (name, type)); 
os.flush(); 


//end class NetworkClassLoader 
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abrication advances are causing most 

processors to move from 32-bit words 

to 64-bit words. So what good are 64- 

bit machines? One advantage is that 
they can address more than 4 GB of mem- 
ory. Another is that they can operate on 
64 bits of data at once. The data need not 
be a single 64-bit integer; it could be eight 
characters. Listing One shows an exam- 
ple implementation of the C function 
strlen, which returns the offset of the first 
Q byte in a string. It is tailored to Big- 
endian 64-bit machines. The gimmick is 
this line: 


b = (w&-m) + ((w&m)>>7)) & m; 


that looks for the first “all ones” byte in 
w. It does this by adding the high-order 
bit in each byte to the 7 low-order bits, 
and looking for a carry into the high- 
order bit. Hence 8 bytes are processed 
in parallel on each loop iteration. Once 
it has found such a carry, the code per- 
forms a series of shift and mask opera- 
tions to identify the leftmost such carry; 
see the accompanying text box entitled 
“Formula for Repetitive Constants.” 


Arch is the lead developer for KAI C++. He 
cut his 64-bit teeth on the nCUBE-2 pro- 
cessor in 1990. He can be reached at arch 
robison@intel.com. 
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Moving Up to 64 Bits 


The disadvantage of the technique is 
the extra startup overhead for alignment 
considerations, and the fact that it needs 
more instructions per loop iteration. Thus, 
for short strings and 32-bit processors 
(which yield only four-way parallelism), 
it rarely pays off. Furthermore, some 





newer processors have multimedia 
instructions for parallel operations on 64 
or even 128 bits without the need for the 
carry trickery in the example. 

The choice of strlen is for illustrative 
purposes— most modern compilers and 
hardware specifically optimize the native 
version of strlen so well that the parallel 
version may well be slower on your sys- 
tem these days. 

As you might expect, there are a num- 
ber of issues involved in migrating from 
32 to 64 bits. In this article, I’ll describe 
some of the migration issues that I have 
found to be significant. 

Java solves the portability problem by 
creating a virtual machine model every- 
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where that promises (or at least advertis- 
es) identical results for sequential pro- 
grams. (Parallel programs are often nat- 
urally nondeterministic.) Doing so dumbs 
all machines down to the lowest common 
denominator. Java’s Procrustean approach 
also prevents programs from growing with 
their machines. For example, Java limits 
arrays to about 2 billion elements. This 
makes sense on 32-bit toaster ovens, but 
is a limitation on 64-bit machines. 

There are many theoretically possible 
problems in the migration from 32 to 64 
bits. Pll stick to the ones that I’ve seen 
happen in practice, starting with the com- 
mon language-independent issues, and 
then moving on to those peculiar to C/C++. 

Perhaps the biggest hassle with migra- 
tion from 32-bit to 64-bit platforms is not 
the details of coding— it’s the fact that 64- 
bit binaries won’t run on 32-bit platforms. 
Thus to exploit the 64-bit capabilities, 
you'll end up shipping both 32-bit bina- 
ries and 64-bit binaries, which drives up 
production and testing costs. I know of 
no panacea for this. It’s a cost of progress. 


Layout Issues 

Obviously, 64-bit addresses and integers 
take up twice as much space as their 32- 
bit counterparts. The doubling can slow a 
program down; half as many integers or 
pointers can fit in cache at once. Thus big- 
ger is not necessarily better. 

The size of structures is not only af- 
fected by the size of fields, but also of 
their alignment constraints. On 64-bit ma- 
chines, the alignment of Jong may be 
stricter than for int or float, it may be re- 
quired to reside at a multiple of 64 bits 
instead of 32 bits, and likewise for type 
double. Consequently, structures con- 
taining type double may grow about 33 
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(continued from page 66) 

percent larger than their size on a 32-bit 
machine, even though the sizes of the 
fields are unchanged because the com- 
piler must insert more padding in the 
structure to guarantee alignment require- 
ments. For instance: 


struct Foo { 
double x; // 8 bytes 
bool y; // 1 byte 


// Implicit padding here. 


requires 12 bytes for 32-bit alignment, but 
16 bytes for 64-bit alignment. (Coding tip: 
For good packing of structures, declare 
fields in decreasing order of size.) 


Numerical Issues 
Sixty-four-bit integers have much more 
range. Whereas the maximum value for a 
signed 32-bit integer is about 2 billion, the 
maximum value for a signed 64-bit inte- 
ger is about 9.223x10}8. I recommend 
learning the order of magnitude and the 
first few leading digits of this number, and 
likewise for the maximum for an unsigned 
64-bit integer (about 18.44x10”). It’s handy 
to be able to recognize them in decimal 
dumps, particularly the unsigned value, 
since it corresponds to a word of all ones. 
While most problems in migrating to 
64-bit platforms arise from the extra high- 
order bits, the low-order bits can cause 
trouble too. Both 32- and 64-bit machines 
typically use IEEE floating point. An IEEE 
double- precision floating- pointer number 
(typically type double in C) has 64 bits, 
but 12 of those bits hold the sign and ex- 
ponent. Converting a 64-bit integer to a 
double can cause a loss of precision, which 
was never a problem with 32-bit integers. 


Avoiding C/C++ Traps 

C/C++ sometimes have a bad reputation 
for portability. In fact, it is usually straight- 
forward to write portable C/C++. Think 
of the machine architecture as one of the 
inputs to your program, and parameter- 
ize your program accordingly. 

When writing portable code, it is im- 
portant to understand the difference be- 
tween a language standard and an im- 
plementation. For some _ proprietary 
languages, there is no difference: The im- 
plementation defines the Standard. But 
for many languages (for instance, C, C++, 
Fortran, Ada, Cobol) there are interna- 
tional Standards that say what is guaran- 
teed and what is not. For example, giv- 
en the statement: 


alj++]=blj++] 
some implementations of C increment the 
value of 7 twice. However, the ISO stan- 


dard for C says that the effect on / is un- 
defined. Some of the problems that I will 
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point out are actually latent violations of 
the C/C++ Standards that are exposed by 
the migration to 64-bit platforms. To im- 
prove the chances that your compiler 
catches the problems for you, I strongly 
recommend that you turn on all the type- 
checking available. 

Adhering to the Standards and param- 
eterizing your code appropriately are 
good, but only up to a point. You have 
to draw a line somewhere. In principle, 
machines can be really weird and still have 
implementations of C/C++ that conform 
to ISO standards. For example, 13-bit char- 
acters and one’s complement arithmetic 
are theoretically possible. But practically 
speaking, except for DSP chips and an- 
tiques, machines have 8-bit characters and 
two’s complement integers with a size that 
is a power of two. Writing excessively 
general code can be just as much a waste 
of time as writing excessively narrow code, 
so give heed to Samuel Johnson’s advice 
before coding: “No man but a blockhead 
ever wrote except for money.” 

Here are the three principle sources of 
information for parameterizing your code 
for word size. The first is the sizeof oper- 
ator. Use it religiously to compute the sizes 
of types, not only when allocating objects, 
but also when packing or unpacking bits 
in integers. The second is <limits.h>. It de- 
fines CHAR_BIT, the number of bits in 
type char, and the maximum and mini- 
mum values for integral types; for exam- 
ple, ULONG_MAX for the maximum val- 
ue of an unsigned = long. The third is the 
integral types themselves. Choosing the 
right type for a purpose is part of param- 
eterizing your code for word size. 


long and inf: Noticeably Different 

The most frequent gotchas in 64-bit code 
arise not from bigger absolute sizes of 
types, but from changes in relative sizes: 
Types that were the same size on 32-bit 
systems are often no longer so on 64-bit 
systems. Indeed, this was the major stum- 
bling block in the migration from 16-bit 
to 32-bit systems. In particular, modern 
32-bit code sometimes presumes, often 
accidentally, that sizeof(int)==sizeof(long) 
==sizeof(pointer), which is almost never 
true on 64-bit machines. 


long Bigger Than int 
On 64-bit platforms, it is usually true that 
sizeofVlong)>sizeof(int). The exceptions 
are cretinous implementations with 32-bit 
long and int, and the long long abomina- 
tion for 64-bit integers. The rest of this ar- 
ticle presumes a clean 64-bit system: 32 
bits for int, 64 bits for Jong and pointers. 
The growth of Jong to bigger than int 
can cause some subtle errors in programs. 
The most common program error is acci- 
dental truncation, as shown here: 
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long x = 0; 
for( int k=0; k<sizeofdong)*CHAR_BIT; 
++k ) 
x |= 1<<k; 


This code appears to set all bits in x to 
1, and does so when sizeoft(long)==sizeof- 
(int). However, when sizeofVong)>size- 
ofCint), there is a problem: The 1 has type 
(int), and thus the shift is done using int 
arithmetic, not Jong arithmetic. When the 
shift count is greater than or equal to the 
number of bits in the type, the ISO rules 
say the result is undefined. For instance, 
one common platform simply ignores all 
but the rightmost 5 bits in the shift count, 
and thus evaluates 1<<32 the same as 
1<<0, which is 1, not the intuitively ob- 
vious 0. 

The correct way to write the affected 
line is: 

x |= IL<<k 


or, if writing a template and x is of some 
unknown type 7, write: 


x |= T(1)<<k 


Be on the lookout for similar truncation 
problems with other operators. I know of 
no good automatic defense against the 
problem. 

While on the subject of bit manipula- 
tion, there is one other gotcha for 64-bit 
systems: You need 6 bits, not 5, to spec- 
ify the index of a bit within a word. Bit 
fields that hold bit indices must be de- 
clared with 6 bits instead of 5. Otherwise, 
the critical 6th bit of such indices is silent- 
ly lost. 


Pointers Bigger Than int 

Pointers and type int usually have the 
same size on 32-bit platforms these days, 
but not on 64-bit platforms. There are 
two common kinds of mischief (or de- 
served punishment) arising from this 
change. 

First, C permits conversions between 
pointers and integer, and regrettably some 
programs habitually do so. If the con- 
version is between a pointer and type 
int, the code will likely break on a 64- 
bit platform because the 32-bit int will 
not hold all the bits that were in the 
pointer. 

C++ helps a bit in this matter— it ex- 
pressly disallows converting a pointer to 
an integer with fewer bits. Given a point- 
er p, the expression (int)p might compile 
without complaint on a 32-bit platform, 
and be rejected by a compiler for a 64-bit 
platform. Hence, if you must cast between 
integers and pointers, it is probably safest 
to use type Jong for the integers. 

Second, the difference in sizes for point- 
ers and int exposes the difference between 
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adding a signed or unsigned int to a point- 
er. Consider: 


char * p; 
iat i 
unsigned int u; 


Does p+(uti) point to the same memory 
location as (p+w)+i? That is, is addition as- 
sociative? Let’s plug in some numbers to 


External file 
formats can cause 
eternal portability 


grief 


find out. Let uw=—1 and i==1. The first ex- 
pression simplifies to p+(unsigned)O. The 
second expression is most likely illegal on 
most 32-bit machines, because p+u is out 
of bounds unless the object pointed to by 
p occupies all memory. But most hardware 
will happily execute it, and once 7 is added, 
it will wrap around to yield the value of 
Dp, a valid address. Thus the error goes un- 
noticed. But on a 64-bit machine, (p+w)+1 
simplifies to a pointer 4 GB away from p, 
which is either an illegal address (result- 
ing in a machine fault when dereferenced), 
or a legal address in a really big object 
(which is even worse if the intent was ad- 
dress p, because it silently works). 

A related problem involves changes in 
the relative sizes of size_t and ptrdiff_t 
compared to int. Typically, type size_t is 
32 bits, is on a 32-bit machine, and 64 bits 
on a 64-bit machine. Furthermore, size_t 
is often unsigned int on 32-bit machines, 
and unsigned long on 64-bit machines — 
a change in type as well as size. Type 
ptrdiff_t usually changes likewise, except 
that it is always a signed integral type. 


Choosing Integral Types 

Here are my rules of thumb for picking 
integral types: If a value fits in 8 bits (and 
space is at a premium), use an explicitly 
signed or unsigned char. Be leery of us- 
ing a plain char for anything other than 
ASCII because the standards permit im- 
plementations to treat plain char as un- 
signed or signed. If a value fits in 16 bits, 
use a short or unsigned short. If a value 
needs to be handled fast and fits in 16 bits 
(in critical loops, for instance), use type 
int or unsigned int. If a value needs at 
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least 32 bits, use type Jong or unsigned 
long because the C/C++ Standards guar- 
antee only at least 16 bits for int and un- 
signed int. If a value needs to be bigger 
than 32 bits and you might be running on 
a 32-bit machine, use some sort of pack- 
age for multiprecision arithmetic. In C++, 
overloaded operators make such packages 
painless to use. 


Formatting and 1/0 

The switch from 32 to 64 bits can intro- 
duce formatting and I/O problems. The 
problems arise from both increases in 
magnitudes and from changes in the rel- 
ative sizes. Consider the following exam- 
ple, which has both sorts of mistakes: 


long j=... 
char buffer[12]: 
sprintf( buffer, "%d", j ); 


There are two problems here when a /ong 
grows from 32 to 64 bits. First, more dig- 
its are necessary to print numbers. The ar- 
ray size 12 was big enough to hold an op- 
tional minus sign followed by 10 decimal 
digits and a null terminator. The switch to 
a 64-bit Jong potentially adds nine more 
digits. Table 1 lists the digit requirements. 

Second, the format string has a subtle 
error, even though it works when size- 
oflong)==sizeof(inv). The format for print- 
ing a Jong requires an / modifier: It should 
be %ld, not %d. On a machine with size- 
of(long)>sizeoflint), such an error may 
cause the value to be printed incorrectly, 
or for formats that print multiple values, 
errors in printing the other values because 
the format string “lies” about parameters 
that were passed. Always remember the / 
modifier when printing a /ong. Here’s the 
corrected example: 


long j=... 
char buffer[21] 
sprintf( buffer, "ld", j ); 


Also note that type size_t may be an un- 
signed int or unsigned long. Thus when 
printing a size_t, cast it to an unsigned long 
and include the / modifier, as shown here: 


size_t S; 
printf("size=“%ld\n", (unsigned long)s ); 


The best solution is to use C++ and 
std::ostringstream — then you are not re- 
sponsible for preallocating storage or the 
/ modifier. 

External file formats can cause eternal 
portability grief since they become set in 
stone. Think hard about the following when 





Table 1: Digit requirements for 
migration. 
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(continued from page 70) 
inventing a new file format. The common 
sin is writing and reading raw structs to and 
from files. While such code is easy to write, 
it locks the file format to the particular struc- 
ture layout and byte order of a machine. 
The simplest portable approach is to al- 
ways write and read bytes— not words. 
There is a format used by the DWARF de- 
bugging format that is an excellent way 
to write and read integers in a machine- 
independent fashion. The format writes 
out an Bee integer 7 bits at a time, 


starting with the least-significant 7 bits. 
The upper bit of each byte is used as a 
delimiter: 0 means end of integer; 1 means 
another byte follows. Not only is this for- 
mat machine independent, it compresses 
small integer values. Thus, the decrease 
in I/O traffic can easily make up for the 
added time for encoding and decoding, 
particularly in the current milieu where 
processors grossly outrace storage devices. 

For example, unsigned integers between 
Q and 127 take up only a single byte. There 
is a similar format for signed seal that 


Formula — je Con: stants, 


ee ea 
peats every few | J 


dious to write, and i it's ‘easy to miscount 
digits. The following formula is handy 
for writing such a constant of type Jong 
that repeats a value K every N bits, 
when the word size is a multiple of N: 
~OUL/((1<<N)- 1) #K. 





differs only in that when read, the 6th bit 
of the last byte is treated as a sign bit. List- 
ing Two shows some concise C++ template 
functions for such encoding and decoding 
for both the signed or unsigned format. 

Of course, sometimes efficiency or 
simplicity constraints dictate that ma- 
chine integers be written out directly. If 
you need to do so, I recommend at least 
inventing a file header that describes the 
number of bytes in an integer, and the 
byte order. One gimmick for describing 
byte order is to write a number such as 
“0x0706050403020100” as a raw integer at 
the beginning of the file. On a Big-endian 
machine, the resulting byte sequence is 
“76543210,” ona Little-endian ma- 
chine it is “O 123 45 6 7.” The byte with 
value 1 describes exactly where to find 
the mth most significant byte. 


Conclusion 

The migration from 32-bit machines to 64- 
bit machines is really a test of how clean 
your code is— does it work accidentally 
or according to standards? Do it right, and 
youll be able to brag to your grandchil- 
dren when they are groaning over the 128- 
bit to 256-bit migration. 


DDJ 
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Listing One 


// Big-endian code. Presumes 8-bit bytes and 64-bit type long. 
size_t strlen( const char * s ) { 
typedef unsigned long word; 
// Align on word boundary 
// Presumes that sizeof(word) is power of 2. 
const char * t = g; 
for( ; (unsigned long)s & sizeof(word)-1; ++t ) 
if( !*t ) 
return t-s; 
// Search for word containing a byte equal to zero. 
word b; 
const word m = 
const word * u 
do { 
word w =~*utt; 
// For each byte in b, set high bit of byte if corresponding byte 
// inw is a zero. Do this by adding high bits of each byte to 
// other 7 bits of each byte, and look for carry into high bit. 
// There are many other possible sequences -- the best depends 
// upon your compiler and target machine. 
b = ((w&~m) + ((w&m)>>7)) & m; 
} while( !b ); 
t = (char*) (void*) (u) ; 
// t now points to word *after* the word with the zero. 
// Binary search b. for position of high-order 1. 
// The logic here is for big-endian machines with 64-bit words. 
if( b&@xFFFFFFFFOOOGOOOGUL ) { 


((word)-1 / OxFF)<<7; 
= (word*) (void*)t; 


b >>= 32; 
t -= 4; 
} 
if( b&@xFFFFO@OOUL ) { 
b >>=16; 
t -= 2; 
} 
if( b&@xFFOOUL ) { 
b >>=8; 
t -= 1; 
} 
return (char*) (void*)t-s-1; 
} 
e e 
Listing Two 


// The functions below are written as templates so that they 

// employ the signed or unsigned format according to whether 

// type T is a signed or unsigned integral type respectively. 

// The idiom "-(T)1>@" is a way of asking "is T an unsigned type?" 
// For simplicity, the routines below presume that when type T 

// is a signed integral type, operator>> does sign extension. 


side compone . bro ndep: | 
on-the-fly. Works within single server or web farm environments. 


// This is widely common behavior, but not mandated by ISO C/C++. 
typedef unsigned char byte; 


// Template function "encode" writes the encoding of value i to 
// the sequence starting at p, and returns an iterator that points 
// to the end of the sequence. 
template<typename Iterator, typename T> 
Iterator encode( Iterator p, Ti) f{ 
// If T is unsigned type, loop while u is outside range @..127 
// If T is signed type, loop while u is outside range -64...63. 
for(; (-(T)1>@ ? i : i+64u)>127u; >>=7 ) 
«p++ = i&127/128; 
xp++ = 18127; 
return p; 


// Template function "decode" sets "result" to the value decoded 
// from a byte sequence starting at p, and returns an iterator 
// that points to the end of the sequence. 

template<typename Iterator, typename T> 

Iterator decode( Iterator p, T& result ) { 


byte c; 
// Accumulate value in u from bytes until terminator is found. 
do { 


c = *ptt; 
u j= (c&127)<<s; 
s t= 7; 


} while( c&128 ); 

// Subtract ® or 1 depending upon bit 6 of last byte. 
// This has the effect of sign-extending u. 

result = -(T)>@ 2? u: u- (c>>6&<<s); 

return p; 


DDJ 








C# Versus Java 





Do we really need 
another language? 





Marc Eaddy 


icrosoft describes C# (“C sharp”) as 
a “simple, modern, object-oriented, 
and type-safe programming lan- 
guage derived from C and C++.” 
That statement would apply equally well 
to Java. In fact, after comparing the two 
languages, it’s obvious that prerelease de- 
scriptions of C# resemble Java more than 
C++. As Example 1 illustrates, the language 
features and syntax are similar. Example 
1(a) is the canonical “Hello World” pro- 
gram in Java, while Example 1(b) is the 
program in C=. 

But the resemblance goes beyond syn- 
tax, keywords, and delimiters. It also in- 
cludes features that Java and C# hold in 
common, such as: 





e Automatic garbage collection. 

e Reflection for type information discovery. 

e Source code is compiled to an inter- 
mediate bytecode. 

e Just-in-Time (JIT) compilation compiles 
bytecode into native code. 

e Everything must be in a class— no glob- 
al functions or data. 

¢ No multiple inheritance, although you 
can implement multiple interfaces. 

e All classes derive from Object. 

e Security for restricting access to re- 
sources. 

e Exceptions for error handling. 

e Packages/namespaces for preventing 
type collision. 

e Code comments as documentation. 

e Arrays are bounds checked. 

e GUI, networking, and threading support. 

e No uninitialized variables. 


Marc is a project leader developing real- 
time stock market applications at ILX Sys- 
tems. He can be contacted at me133@ 
columbia.edu. 
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e No pointers. 
¢ No header files. 


C#: The Evolution of Visual J++ 

Why does Microsoft think we need an- 
other language? When Microsoft intro- 
duced Visual J++ in October 1996, it threw 
lots of resources into the project. Their ef- 
forts produced the fastest JVM on the mar- 
ket and the Windows Foundation Classes 
(WFC), a set of Java classes that wrapped 
the Win32 API. Not coincidentally, Anders 
Hejlsberg, the project leader for WFC (and 
most famous as the author of Turbo Pas- 
cal), is the chief architect for C=. 





Microsoft decided to make changes to 
Java to integrate it more closely with Win- 
dows. Some of the changes — interfacing 
seamlessly with COM, refusing to support 
RMI and JNI, and adding delegates — 
caused it to break compliance with the 
Java Standard. Consequently, Sun Mi- 
crosystems sued Microsoft in October 1997 
for violating its Java license agreement. 
This doomed Microsoft’s future develop- 
ment of Java and Visual J++. However, Mi- 
crosoft decided to take its advances in the 
Java language, Java compiler, and JVM and 
morph them into an even more ambitious 
project — Microsoft .NET. 


Microsoft .NET 
The term “Microsoft .NET” is similar to 
“Windows DNA” in that it refers to many 
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things: a business strategy, development 
model, marketing pitch, development plat- 
form, and language run time. But where- 
as DNA is more of a “best practice” ap- 
proach to development, at the heart of 
.NET is the Common Language Runtime 
(CLR), a set of operating-system services 
and an execution engine that provides 
run-time support for .NET programs. You 
can think of the Common Language Run- 
time as fulfilling the same role as the Java 
virtual machine. 

Programs written in C# are compiled into 
an intermediate language called “MSIL,” the 
equivalent to Java bytecode or Visual Ba- 
sic p-code. Any language that can be com- 
piled to MSIL can take advantage of the 
CLR features such as garbage collection, 
reflection, metadata, versioning, events, and 
security, to name a few. In addition, a class 
written in one language can actually inherit 
from a class written in another language 
and override its methods. 

Although this article is about C#, keep 
in mind that the class libraries and the 
CLR features are usable by any language 
that has an MSIL compiler. Initially, Mi- 
crosoft will provide MSIL compilers for 
C#, Visual Basic, JScript, and Managed 
C++. Third-party vendors have also de- 
veloped .NET compilers for a number of 
languages, including: Java (Rational), Eif- 
fel (interactive Software Engineering and 
Monash University), Perl (ActiveState), 
Python (ActiveState), Scheme (North- 
western University), Smalltalk (Quasar 
Knowledge Systems), Cobol (Fujitsu), 
Component Pascal (Queensland Univer- 
sity of Technology), APL (Dyalog), Stan- 
dard ML (Microsoft Research-Cambridge), 
Mercury (University of Melbourne), and 
Oberon (ETH Zentrum). 

It’s interesting that Microsoft is pushing 
cross-language development while Sun/Java 
pushes cross-platform development. 

However, both approaches have their 
share of problems. Writing components 
in multiple languages always entails some 
interoperability problems. Moreover, there 
is always the problem of what to do when 
your Scheme programmer moves on to 
greener pastures. Cross-platform devel- 
opment has also never been flawless, as 
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(continued from page 74) 
Java programmers well know, especially 
in the areas of GUIs and threading. 


C# Similarities to Java 

In addition to sharing a number of fea- 
tures, most of the keywords in Java have 
their C# counterpart. Some keywords are 
identical; for example, new, bool, this, 
break, static, class, throw, virtual, and 
null. This is expected since these key- 
words are derived from C++. Interesting- 
ly, many keywords in Java that do not have 
direct C++ equivalents such as super, im- 
port, package, synchronized, and final, have 
different names in C# (base, using, name- 
space, lock, and sealed, respectively). Table 
1 summarizes some of the keywords and 
features that exist in both languages, but 
look slightly different in C+. 


The Object Class 
Another good example of cosmetic dif- 
ferences is the System.Object class in C#, 
which has the exact same methods as the 
java.lang.Object class in Java except they 
are spelled differently. The clone method 
in Java is called MemberwiseClone in C#, 
Java equals is Equals in C#, finalize is Fi- 
nalize, getClass is getType, hashCode is 
GetHashCode, and toString is ToString. 
Mere coincidence? Well, according to An- 
ders Hejlsberg in a recent interview, “C# is 
not a Java clone” (“Deep Inside C#: An In- 
terview with Microsoft Chief Architect An- 
ders Hejlsberg,” by John Osborn, http:// 
windows.oreilly.com/news/hejlsberg_ 
0800.html). Right, it’s a MemberwiseClone. 


Access Modifiers 
C# specifies access modifiers inline as part 
of the member definition just like Java 
does, instead of in a block like C++. The 
modifiers public and private have the ex- 
act same meanings in all three languages. 
However, “protected access” in Java is 
called “protected internal” in C#, and 
“package access” in Java is called “inter- 
nal” in C#. C#’s protected modifier gives 
access to any subclass, even if it is not in 
the same program. Another difference is 
that package access is the default for Java 
while private is the default for C#. 








Exceptions 

Similar to Java, try blocks in C# support 
the finally clause. There is no throws 
clause in C#, so effectively, all exceptions 
are unchecked. You aren’t forced to han- 
dle any exceptions. The opinion at Mi- 
crosoft, according to one employee, is that 
forcing developers to handle exceptions 
does more harm than good. This leads to 
many catch(Exception e) exception han- 
dlers that don’t do anything useful. 

This is an unfortunate decision because 
the throws clause makes it obvious which 
exceptions can be thrown by a method 
and are part of the method’s contract. Oth- 
erwise, you are forced to read the called 
method’s source code to know which ex- 
ceptions can be thrown. 


C# Improvements Over Java 

Since C# and Java look and act alike in 
many ways, why bother using C# at all? 
As you might expect from a successor 
language, C# makes improvements over 
Java in some areas. In addition to adding 
innovative features, C# has a simplified 
syntax for things such as iteration, events, 
and treating primitive types like objects, 
which reduces the amount of code you 
need to write. 


Reflection, Metadata, 

And Custom Attributes 

Java and C# compilers both emit metada- 
ta with the class bytecodes to support re- 
flection. Reflection provides the ability to 
obtain type information dynamically and 
makes it possible for the run-time system 
to automatically provide implementations 
for run-time type identification, dynamic 
method invocation, serialization, marshal- 
ing, and scripting support. 

Microsoft has extended the notion of 
metadata with Custom Attributes, which 
let you markup a class, method, method 
parameter, and just about anything else, 
with extra information that can be ac- 
cessed at run time. When compiled, the 
attributes are combined with the EXE/DLL 
itself, so it is not possible for them to be 
out of sync with the code. Attributes elim- 
inate the need to maintain separate IDL 
files and type libraries. The C# compiler 





Example 1: Hello World. (a) in Java; (b) in C#. 
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supports attributes for interoperating with 
legacy COM objects (Internet time indeed!) 
and the Win32 APIs, object serialization, 
conditional code execution, and depre- 
cating program entities. 

An example of where attributes are use- 
ful is specifying the XML schema to use 
when serializing a class (see Listing One). 
As you can see, the class is marked-up 
directly with bracketed attributes to indi- 
cate what XML node and attribute names 
to use when serializing or deserializing an 
Album object. Listing Two is a program 
that creates an A/bum and serializes it. 
Listing Three is the resulting XML file. (The 
command line for getting the XML serial- 
ization example to compile using the pre- 
release C# compiler is csc.exe /r:Sys- 
tem.Xml.dll /r:System.dll /r:System.Xml 
Serialization.dll Albums.cs.) 


Versioning 
Versioning is a feature that is much need- 
ed in the Windows and Java development 
space. Windows developers are accus- 
tomed to “DLL Hell,” where mismatched 
versions of DLLs can cause all kinds of 
application problems. Java developers are 
also familiar with deprecated APIs and in- 
compatible versions of serialized objects. 
.NET promises to fix these problems by 
letting you specify version dependencies 
between components and by supporting 
side-by-side execution of multiple ver- 
sions of a component. This is similar to 
the versioning capabilities specified in the 
Java Product Versioning Specification, ex- 
cept that compile-time, installation time, 
and run-time enforcement is built-in. 


Assertions 

Assertions are useful for sanity checking 
and enforcing the preconditions, post- 
conditions, and invariants of Design By 
Contract. They often catch bugs introduced 
by faulty design/logic, incorrect assump- 
tions, integration, and code maintenance. 
Assertions are not supported in Java, al- 
though the Java Community Process is 
working on it. Listing Four demonstrates 
assertions in C#, and Figure 1 shows the 
assertion dialog that appears. It lists the 
filename and line number where the as- 
sertion occurred, the general and detailed 
error messages, and the stack trace. 


ref and out Parameters 

You can pass parameters by reference or 
specify that they are output parameters by 
using the ref or out modifiers. Since Java 
only allows pass-by-value, you have to 
perform silly tricks to get the same effect, 
such as using return values, passing in a 
one-element array, or putting objects in- 
side wrapper classes. An example of pass- 
by-reference is: void swap(ref long n1, ref 
long n2);. 
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Virtual Methods 

By default, all methods in Java are virtu- 
al and can be overridden by a derived 
class. In C#, as in C++, methods must be 
explicitly declared virtual. A common er- 
ror in Java and C++ when overriding meth- 
ods is when someone inadvertently mod- 
ifies the signature of the base class 
method. Because the signatures don’t 
match, the derived class method hides in- 
stead of overriding the base class method. 
C# turns this into a compile-time error by 
requiring that the derived class use the 
override keyword to override a virtual 
method. In addition, a compile-time warn- 
ing is given if a method in the derived 
class hides a method in the base class. In 
this case, you can use the new keyword 
to remove the warning. 


enums 

Java does not implement C/C++-style 
enums because its designers claim that 
enums are not object oriented. Face it, 
enums are a lot more typesafe than Java 
static final int constants. enums are not 
only present in C#, they are typesafe, 
can have the ++, -, <, and > operators 
applied to them, and can be converted 
to and from a string. 


decimal Data Types 

C# decimal data types are 128 bits and 
have a greater precision and smaller range 
than floating-point types. They are par- 


ticularly useful for —- apo 


Inheritance 


Assertion ail » 





Figure 1: The assertion dialog. 


switch Statements 

By default, case labels in a C# switch state- 
ment do not fall through to the next case 
label. In addition to being able to speci- 
fy an integer in the switch and case state- 
ments, you can also specify a string. 


C# Syntactic Sugar 

A method is just a function in which the 
first argument is a pointer to the object. In- 
stead of calling foo(object,x,y), C++/Java/C# 
allow you to write object.foo(x,y). This is 
an example of how syntactic sugar makes 
object-oriented development easier. C# 
provides some sugar of its own to make 
component-oriented development easier. 


Delegates and Events 

Delegates are a significant innovation for 
C#. They are object-oriented function 
pointers that can reference static or in- 
stance methods. They provide a type-safe 


mechanism for implementing callback 


. ements |FooBar i 


Referring to the 
base class 


| 
. super. hashCode): 


functions and events. C# supports single- 
cast, multicast, synchronous, and asyn- 
chronous delegates. 

Java provides for events by using the Java- 
Beans event model and adapter classes. 
Event handling in C# is simpler and only 
requires you to implement individual meth- 
ods instead of entire interfaces. Dele- 
gates were first introduced as a non- 
standard feature of Visual J++, much to 
the dismay of Sun Microsystems, which 
derided delegates as being “not object 
oriented.” For Sun’s delegates argument, 
see http://www.javasoft.com/docs/white/ 
delegates.html, and http://msdn.microsoft 
.com/Vvisualj/technical/articles/delegates/ 
truth.asp for Microsoft’s rebuttal. (Some 
even conjecture that the delegates issue 
was the turning point that lead to the Java 
lawsuit.) 

The event keyword introduces a dele- 
gate into your class that lets you fire 
events. Listing Five is a sample C# pro- 
gram that uses an event to notify a Stock- 
Tracker when the price of a stock changes. 


Value Types 

Value types in C# (long, int, char) can be 
treated just like reference types without 
requiring special wrapper classes as in 
Java Gava.lang.Long, for example). The 


import java.util; 
package MyStuff; 


Importing types 


Namespaces compiler implicitly converts value types 


into objects (and vice versa) on demand 
through a process called “boxing and un- 
boxing.” However, this incurs no overhead 
if the value type is never treated like an 
object. This allows C# developers to view 
the world as a unified type system in 
which all data types derive from object. 
Example 2 shows examples of how each 
language lets you treat primitive types as 
objects. 


— MyClass { 


final class Foo { } 
final static int MAX = = 
= * 
- ‘Convert string to 
- uppercase. 
* 


Prevent inheritance 
Constants 
Code comments 


. _ * /@param s The String to 
. convert _ 


Properties 

For our purposes, properties and fields 
refer to two different concepts. A field is 
a class data member (Jong size, for ex- 
ample), whereas a property is a pair of 
getter and setter access methods that pro- 
vide access to a field or calculate the need- 
ed values. Properties in C# are similar to 
JavaBeans properties in that access to them 
is controlled through access methods. 


Deprecated classes 

or methods  . 

Synchronization = syne rizedthis) { ( 
- i. +efs; 





Table 1; Cosmetic arene! between Java and C#. | 
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(continued from page 78) 

However, in C#, properties are directly 
supported by the language instead of re- 
lying on reflection and naming conven- 
tions, as in JavaBeans. This leads to a clean 
and concise syntax that makes data hid- 
ing a breeze. Example 3 illustrates how 
the Name property is implemented using 
the name field in both Java and C#. 

The advantages of the C# property style 
is that the getter/setter methods are locat- 
ed in the same block of code and have a 
more intuitive syntax because properties 
are accessed exactly like normal fields. 
JavaBeans properties also look like nor- 
mal fields when accessed from scripts or 
in a visual design tool, but not when ac- 
cessing them from Java code. 


foreach-Style Iteration Syntax 

The foreach statement in C# (borrowed 
from Visual Basic) lets you easily enu- 
merate over classes that support the Enu- 








merable interface, which includes arrays 





and collections. This eliminates the need 
to write for(int i=0; 1 < ary.length; ++) 
and the getlterator/hasNext/next triad as 
you have to in Java and C++. Listing Six 
demonstrates iteration in Java, while List- 
ing Seven demonstrates foreach-style it- 
eration in C#. 


String Formatting 

A great example of C# syntactic sugar is 
string formatting. Java provides the Mes- 
sageFormat class to allow printf-style for- 
matting. The syntax in C# is much cleaner 
because you can pass a variable number 
of parameters to the function. Example 4 
shows off the string formatting features and 
variable parameters of C#. Both examples 
output Error: File not found. (Code 2). 


Operator 

Overloading and Cast Operators 
Operator overloading allows you to rede- 
fine the semantics of operators (+, -, +=, 
==, and so on) for a given type. When used 


{ return n. 


ring 






(b) in C#. 


_ System.Console.Write 
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Example 3: Implementing the Name property using the name field: (a) in Java; 


Example 4: String formatting. (a) in Java; (b) in C#. 


value) 
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properly, operator overloading can provide 
an intuitive syntax for treating user-defined 
types like primitive types. C# provides the 
operator keyword for this purpose. 

In addition to operator overloading, C# 
lets you define cast operators. For exam- 
ple, a class called Fraction can provide a 
cast operator that converts a Fraction into 
a double. 


C# Performance Improvements 
Several C# features result in performance 
improvements over Java. These include: 


e Never interpreted. The advent of Just-in- 
Time (IT) compilers means that Java 
bytecodes are usually compiled directly 
into native code by the JVM instead of 
being interpreted. One difference with 
the Common Language Runtime is that 
the bytecode is never interpreted, it is al- 
ways JITed. Microsoft has a proven track 
record for writing the fastest JVM, so you 
can expect their MSIL JITer to be fast. 
Native platform support for Win32 and 
COM is an improvement over the per- 
formance of J/Direct and the Java Na- 
tive Interface. 
The stackalloc operation is similar to 
new, except that it allocates memory for 
an array on the stack instead of the 
heap. Stack allocation is much faster 
than heap allocation and is not subject 
to garbage collection. 
A struct data type is exactly like an ob- 
ject, except that structs are value types 
instead of reference types and do not 
support inheritance. They can be allo- 
cated on the stack just like primitive 
types and do not require the extra viable 
overhead. Use structs to represent sim- 
ple types, such as Point, Rect, Fraction, 
and so on. They lose their effectiveness 
when their size goes above 16 bytes. 

e Code in C# can be marked unsafe to al- 
low it to have direct memory access and 
prevent the garbage collector from in- 
terfering. 


What's Not to Like? 
Neither Java nor C# support generic types 
(templates). However, a proposal has been 
submitted to the Java Community Process, 
and Microsoft Research at Cambridge is 
said to be developing a solution for .NET. 
Like J/Direct, the P/Invoke class library 
provides the ability for C# to talk to na- 
tive Win32 APIs and DLLs. While this is 
a boon for interoperability, it requires 
writing C# function prototypes that have 
the same signatures as the API functions. 
For example, handles (HWND, for ex- 
ample) are approximated using a C# int 
and pointers are approximated with ref 
parameters. This technique is prone to 
signature mismatches that can cause un- 
predictable behavior and crashes. 
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(continued from page SO) 

Similar to P/Invoke, COM Interop pro- 
vides wrapper classes for allowing C# to 
use COM objects and vice versa. Unfor- 
tunately, certain types of COM interfaces 
are not 100-percent compatible with 
wrappers, which causes an impedance 
mismatch. 


Conclusion 

C# is Java with some nifty features, in- 
novations, syntactic sugar, and perfor- 
mance enhancements thrown in. C#’s el- 
egance, simplicity, and power promises 
to deliver Windows C++ developers from 


For Java programmers, C# picks up 
where Java leaves off by providing a 
high-performance, component- oriented 
language that integrates tightly with Win- 
dows. Let’s hope C# is as much fun as 
Java is to work with. 


the pits of boilerplate COM code and 


memory violations to the land of RAD. DDJ 





Listing One 


using System.NewXml; 
using System.Xml.Serialization; 
[XmlRoot ("album", Namespace="music") ] 
public class Album { 
[XmlElement ("artist") ] 
public string artist; 
[XmlElement ("title") ] 
public string title; 
[XmlArray("songs"), XmlArrayItem("song") ] 
public string[] songs; 


Fire_OnPriceIncreased != null) { 
Fire_OnPriceIncreased(name, value); // Inform listeners 
} else if (value < price && 


Fire_OnPriceDecreased != null) { 
Fire_OnPriceDecreased(name, value); // Inform listeners 
} 
price = value; // Update the price 
} 
} 
// DATA 


string name; 
long price; 


} public event PriceDecreasedDelegate Fire_OnPriceDecreased = null; 
public event PriceIncreasedDelegate Fire_OnPriceIncreased = null; 
} 
ist class StockTracker { 

Listing Two // Outputs a message when the stock price changes 

using System.Xml.Serialization; public StockTracker (Stock stock) { 

using System.I0; // Connect the Stock events to our event handlers 

class TestAlbum stock.Fire_OnPriceDecreased += 

{ new PriceDecreasedDelegate (OnPriceDecreased) ; 

public static void Main() { stock.Fire_OnPriceIncreased t= 
Album album = new Album(); new PriceIncreasedDelegate(OnPriceIncreased) ; 
album.artist = "Sasha"; } 
album.title = "Xpander"; 
album. songs = new string[5]; // Signature of event handler must match the 
album.songs[@] = "Xpander Edit"; // PriceDecreasedDelegate delegate declaration 
album.songs[1] = "Xpander"; private void OnPriceDecreased(string name, long val) { 
album.songs[2] = "Belfunk"; Console.WriteLine("Price of {@} dropped to {1}!", name, val); 
album.songs[3] = "Rabbitweed"; t 
album.songs[4] = "Baja"; private void OnPriceIncreased(string name, long val) { 
// Serialize the object to a file Console.WriteLine("Price of {@} rose to {1}!", name, val); 
FileStream fs = new FileStream("Album.xml", FileMode.Create) ; } 
XmlSerializer serializer = new XmlSerializer (typeof (Album) ) ; } 
serializer.Serialize(fs, album); class StockTester { 
} // Test the events 
} public static void Main() { 


Listing Three 


<?xml version="1.0"?> 


<album xmlns:xsi=http://www.w3.org/1999/XMLSchema-instance 


xmlns="music"> 
<artist>Sasha</artist> 
<title>Xpander</title> 
<songs> 
<song>Xpander Edit</song> 
<song>Xpander</song> 
<song>Belfunk</song> 
<song>Rabbitweed</song> 
<song>Baja</song> 
</songs> 
</album> 


Listing Four 


using System.Diagnostics; 
class ConfigFile { 
bool isFileOpen; 
public void Open(string strFile) { 
// Pre-conditions 


Debug.Assert(!isFileOpen, "Config file already open.", 


"You can only call Open() once."); 
Debug.Assert (strFile.Length > 9); 
isFileOpen = true; 


} 

public static void Main() { 
ConfigFile file = new ConfigFile(); 
file.Open("Joe.xml") ; 
file.Open("Joe.xml"); // Causes an assertion! 


Listing Five 


using System; 


delegate void PriceDecreasedDelegate(string name, long newPrice) ; 


// Called when the price drops 


delegate void PriceIncreasedDelegate(string name, long newPrice) ; 


// Called when the price increases 
class Stock { 
// Holds the price of a stock 
public Stock(string stockName, long startPrice) { 
name = stockName; 
price = startPrice; 


} 

public long Price { 
get { return price; } 
set { 
if (value > price && 


Stock ibm = new Stock("IBM", 100); 
StockTracker tracker = new StockTracker (ibm) ; 


// Update the stock price 


ibm.Price = 125; // Outputs "Price of IBM rose to 125!" 
ibm.Price = 98; // Outputs "Price of IBM dropped to 99!" 
} 
J 
e e e 
Listing Six 


import java.util.*; 
class Iterate { 
public static void main(String[] args) { 


// Enumerate command-line args array 
for(int i = 0; i < args.length; ++i) 
System.out.println(args[i]); 


// Create a linked list 

LinkedList list = new LinkedList(); 
list.add("Cube Farm") ; 
list.add("Sasha & Digweed") ; 


// Enumerate the list 

ListIterator it = list.listIterator(@) ; 

while (it.hasNext()) 
System.out.println(it.next()); 


Listing Seven 


using System.Collections; 
class Iterate { 
public static void Main(string[] args) { 


// Enumerate command-line args array 
foreach (string arg in args) 
System.Console.WriteLine(arg) ; 


// Create a linked list 

ObjectList list = new ObjectList(); 
list.Add("Cube Farm") ; 
list.Add("Sasha & Digweed") ; 


// Enumerate the list 


foreach (string str in list) 
System.Console.WriteLine(str) ; 


DDJ 
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Exploring Perl 


Libraries 





Viewing library 
module data 





Robert Kiesling 


uch of the time it takes to learn an 
object-oriented, GUI-centric lan- 
guage (with its overhead of event- 
driven and graphical objects) is 
spent learning about the class libraries. 
Perl is an exception. It trades off some of 
the facilities of more general-purpose lan- 
guages for speed and simplicity of inter- 
nal data. Still, you can get a reasonable pic- 
ture of an object’s member classes, methods, 
and globals, despite the apparent lack of a 
mechanism for viewing the data. 

The Perl API can include a hierarchy of 
object types and a well-defined API, but 
these can be traded off for the execution 
speed of lower-level data structures. This 
flexibility, almost paradoxically, gives you 
the ability to tinker with library and ap- 
plication code while it is executing. 

Even though the Perl libraries do not nec- 
essarily present an object-oriented API or 
code base, the language can still effective- 
ly implement or simulate a class-based en- 
vironment. The Perl library modules I pre- 
sent here (available electronically; see 
“Resource Center,” page 5), let you view li- 
brary module data within the Perl interpreter 
itself (using the Tk::Browser module) and 
look up the source code and documenta- 
tion for these modules; see Figure 1. 





Perl Library Structure 

In Perl, a module is simply a single file in 
Perl's library directories. A package is syn- 
onymous with a module, but its inheri- 
tance and directory hierarchies are quali- 


Robert is the maintainer of the Linux Fre- 
quently Asked Questions with Answers FAQ 
on the Internet. He can be contacted at 
rkiesling@mainmatter.com. 
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fied relative to the Perl interpreter’s search 
path. A package is declared at the begin- 
ning of most source modules using the 
package keyword: package Lib::Module;. 
This declaration identifies the package file’s 
path name, relative to the Perl interpreter’s 
@INC search path. More significantly, how- 
ever, it tells the interpreter to create and 
use an additional symbol table hash (a 
“stash”) that bears the package’s name. 
Like all Perl hash tables, a stash contains 
a number of key/value pairs, which may 
refer to any of Perl’s recognized data types 
as well as file handles and other stashes. 





The stashes follow the naming conven- 
tion for packages: the hierarchical name of 
the module with the elements separated by 
double colons (::). The interpreter maintains 
each module’s stash separately, identified 
by the package name with double colons 
appended to it. The package, Lib::Module, 
for example, uses the stash Lib::Module::. 
Each stash can be accessed from the call- 
ing package by using the typeglob reference 
to the package handle. In addition, the de- 
fault stash, or main::, contains references 
to all of the other stashes that are in effect. 

In Perl terms, each hash reference is a 
typeglob, which refers to one of the stan- 
dard data types. A reference to a client 
stash would simply be another hash; see 
Listing One(a). The default stash effec- 
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tively provides a class membership to a 
package even if the package doesn’t ex- 
plicitly declare one; contains data acces- 
sible to all library modules; and keeps 
track of which modules have been load- 
ed at run time. The syntax of stash type- 
globs is somewhat like that of a file-handle 
reference, but the stashes are still hash 
objects. Using the exists operator, you can 
test for the presence of a variable name 
in a given stash, as in Listing One(b). 

When actually looking up the elements 
of a symbol table, they are dereferenced 
through a typegiob that refers to the sym- 
bol table entry. An entry often consists of 
more than one data type: An array like 
that in Listing One(c), for example, con- 
tains data of type SCALAR, which typically 
will be the array’s name, as well as data 
of type ARRAY. It is necessary to escape 
the colons with backslashes so that the in- 
terpreter does not treat them as operators. 
Also, the entry is treated as a SCALAR 
when simply looking up the variable 
name, as a HASH when retrieving the en- 
try’s data reference, and as an ARRAY 
when retrieving the data itself. 

Symbol table typeglobs can have up to 
eight key values for the standard data types 
SCALAR, ARRAY, and HASH; see Listing 
One(d). A stash entry can also refer to the 
following: CODE, the interpreted bytecode 
of the program itself; IO, a file handle; 
PACKAGE, the name of the package that 
is using the data; the symbol’s NAME; and 
FORMAT, a reference to the routine that 
formats the data. If not used, the values 
are undefined, with the exception of 
SCALAR, which is always present, even if 
it is simply an empty string. 

Perl defines the type of data lexically; 
stronger type checking does not occur un- 
less you’re writing to the data. But this 
can also contribute to confusion when 
performing multiple dereferences, ac- 
cessing variables with similar names, and 
resolving language syntax ambiguities. 


Static Library Declarations 


The standard Exporter.pm library module 
lets one module’s routines refer to another 


http://www.ddj.com 





You can test and analyze your site 
iualcolelo]atolulmmual—-mel-\-1(e)e)aa(-1al al ©) Keay 
PN iTosiiate incax-laa seco meolealaslelaliecltomrcit-le 


(Oo) | F-lelole-husmsaat-lar-lee-Valemeclar-teu 





problems sooner. So you can deliver your 
it-Melaidlasl-varolaue-) ol-cormr-lale molaMelUlole(—im 
BY-Yor- [Uomo [-1adlale Mi mel0lmr-]aleme(-vadlalemit 
right shouldn't be mutually exclusive. 


Tahagte[6(-Yo Wan @x-1| msistsry at-\ohVAl-\ WA © anol are 


free evaluation, visit www.radview.com. 






Y=) 0) Mef-(e| 


Resource IM ENAE-Te(-1 









| ne . a Dd Ll EW” 




















(continued from page 84) 

module’s data without relying on the run- 
time binding of data and subroutines to a 
particular module. Exporter.pm provides 
an interface for exporting variables and 
functions to another module; see Listing 
Two(a). The receiving module, Listing 
Two(b), can then specify which symbols 
it needs from the original module. Vari- 
ables and subroutines named in the @EX- 
PORT array appear in the receiving mod- 
ule’s symbol table as if they were declared 
in that module. The symbol names in 
@EXPORT_OK get exported only if the 
calling module requests them. 

When a Perl module processes a use 
<modulename> statement, it calls a sub- 
routine named import, which is defined by 
default in Exporter.pm. The difference be- 


#include <stdio.h> 


int quotient(int *q, 


{ 


if(*p) return *q/*p 


else return *q; 


} 


int main() 
{ 
int n = 
int da = 
DEINCE ( 
return 0; 


} 


20, m= 4; 


quotient( &n, 
"%*d/%d == %d\n", n, m, 


tween the statements require <module- 
name> and use <modulename> is that the 
require statement imports all of the sym- 
bols of its argument into the calling mod- 
ule’s symbol table namespace. All of the 
subroutine and variable names act as if they 
were defined in the calling module. This 
can lead to duplicate variable and subrou- 
tine names unless care is taken to export 
only those symbols that a module needs. 
However, you can use a variable or 
function of another module by calling it 
statically, as in Listing Two(c). This call- 
ing convention creates another symbol 
table hash for ExtraModule::, if one does 
not already exist. The Perl interpreter then 
looks for the readfile() subroutine in 
ExtraModule’s namespace, not in the 
namespace of the calling module. 


for C/C++ 
]j Bug of the Month 
lint Prod 


int *p) 


/* compute ratio */ ; 


&m ); 
aq ); 
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The disadvantage of importing data 
statically is that the Perl interpreter can 
completely bypass the Exporter.pm rou- 
tines, which do version and name check- 
ing. It is possible to import data from 
another similarly named module, sub- 
routine, or variable without the inter- 
preter being aware of it. 

Static symbol declarations can interfere 
with the use of the SUPER:: keyword in 
programs, which use method calls and su- 
perclass data. Any subroutine or variable 
imported in this manner must be fully 
qualified. The Perl interpreter makes no 
attempt to look up the name in any of the 
existing symbol table hashes except the 
one specified, or in %main:: if no pack- 
age name qualification is made. This is 
one way a package can override the in- 
terpreter’s data-hiding mechanism. 

In fact, in Perl terminology, a client 
module’s symbol table is referred to as the 
“INNER namespace,” relative to the call- 
ing module’s, which is referred to con- 
ceptually as the “OUTER symbol table.” 
Thus, a fully qualified data reference may 
have the name syntax of *OUTER:.:IN- 
NER: :symbolname. 

The conceptual view of Perl’s module 
evaluation is that the client namespace of 
one package can be contained within the 
namespace of another, even though Perl’s 
import mechanisms go to some pains to 
make each module believe it has its own 
stash. A call to a client package does not 
need to return, and the client package 
does not have a view of the calling pack- 
age’s stash, nor does the client package 
know how it was loaded. 

The use statement checks for the mod- 
ule’s availability when the program is first 
byte-compiled and the interpreter resolves 
external data references. Perl’s AutoLoader 
and DynaLoader modules let references be 
resolved without actually loading the mod- 
ule. The success of a static symbol-name 
lookup depends on the called module’s 
availability when the main program is ex- 
ecuted. This can easily cause misleading re- 
sults when examining the main program’s 
stash, as the reference handles are lexical- 
ly resolved. 

Similarly, the scope of a variable de- 
clared with my extends only to the sub- 
routine, loop, or conditional construct in 
which it is declared. The scope is lexical 
in nature, and the variable is not, under 
normal circumstances, visible in the stash 
of a client subroutine. 


The require Statement and Scoping 

Instead of importing data references, the 
require statement evaluates the called mod- 
ule immediately so that a complete, sepa- 
rate symbol table hash is created in the 
main:: stash. This is the method by which 
modules such as Tk::Browser.pm and 
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Devel::Symdump.pm can view unrelated 
modules’ stashes. Instead of generating pri- 
vate variables, the entire namespace of the 
evaluated module is created as a separate 
environment, and compile-time pragmas 
may be determined independently of the 
calling module’s run-time environment. 

The use of my to declare variables re- 
laxes a “use strict vars” pragma. Other- 
wise, variables must be declared or stati- 
cally qualified. A typeglob declaration that 
refers to a symbol table hash must be de- 
clared as local, so that its scope extends 
to a subsequently created stash; see List- 
ing Three(a). The eval "require package" 
method of symbol-table creation is the same 
as that used by the standard base.pm mod- 
ule, which imports and evaluates modules 
when they are byte compiled, as with the 
use base <packagename>,; statement. 

To create a complete stash context, the 
interpreter must know which stash you’re 
interested in. You state this with the pack- 
age keyword, then you declare the pack- 
age space with a use < package > keyword. 
In many programs, a use statement is suf- 
ficient, because Perl’s AutoLoader mod- 
ule can load functions on demand. How- 
ever, because you want to view the entire 
stash without waiting for a program to ex- 
ecute, you must load the entire package 
with the require keyword. 

When used together, the three statements 
provide a method to simulate a switch to a 


Figure 1: Browser window. 
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different, possibly unrelated, package; see 
Listing Three(b). When examining a sym- 
bol table space with a 7k::Browser, these 
statements must be called in the Tk::Bows- 
er.pm module, so that the stash of the client 
module is visible to the 7k::Browser stash. 
Conversely, when examining a symbol table, 
all that is necessary to avoid confusing one 
stash context with another is to match the 
stash’s handle: 


if( $key =~ /VERSION/) && ($package =~ 
/New::Package/) { 


Classes and Object References 

The basic mechanisms for creating objects 
in Perl are the bless() and tie() functions. 
Each creates an association between an ob- 
ject reference (almost always a hash vari- 
able) and a module package. Once an ob- 
ject is blessed, any references to the object 
or its hash keys is referred to the package. 
Objects are most commonly blessed when 
they are constructed. Perl constructors, like 
those in other other object-oriented lan- 
guages, are commonly named “new,” al- 
though a package is free to follow its own 
naming conventions. In Listing Four(a), the 
constructor is called as a method. It returns 
the object, which is now registered with 
Perl, as a reference to a Lib::Module object. 
In effect, bless creates the new variable in 
the called package’s symbol table, in- 
stead of the calling package’s. The calling 
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package receives a reference to the new- 
ly created object, but it refers only to the 
called package’s newly created variable. 

The main program does not need to ex- 
plicitly name the object’s class. The refer- 
ence to the object knows which package it 
belongs to and the program will call the 
appropriate function as an object method; 
see Listing Four(b). While the symbol table 
value of *m{PACKAGE} will be Tk::Brows- 
er, the value of ref( $m) will be the name 
of its member package— in this example, 
Lib::Module. Of course, you still need to 
specify the package that contains the con- 
structor routine, where the bless function 
registers package membership. In contrast, 
using an unblessed reference, you still could 
call a function statically, as in Listing Four(c). 

Another method of static module load- 
ing is via the require statement, which 
loads a package’s symbols into the call- 
ing program’s namespace. 

The tie() function provides a set of func- 
tion references (for example, READ, STORE, 
and PRINT) in the object’s package. They 
provide a facility for user-defined functions 
of common assignment and retrieval func- 


tions. The exact functions a data type rec- 
ognizes are described in the perltie(1) 
manual page (http://www.frognet.net/ 
help/manpages/docs/perltie.html). 


Inheriting Multiply 

There are two main mechanisms for spec- 
ifying a package’s class membership: the 
well-documented @/SA array and the less 
well-known use base statement. The first 
specifies how the Perl interpreter should 
go about looking up symbols. The use base 
statement is invoked when the program is 
byte compiled, and reads another module’s 
code into the symbol space before the main 
module’s code is interpreted. (This is sim- 
ilar to the require statement.) 

If you had a Tk module that exported 
symbols as well as derived some of its func- 
tions from the main widget class, you would 
need to specify both of its superclasses: 


package Tk::NewWidget; 
@ISA=qw(Exporter Tk:: Widget); 


This adds references to the Exporter:: and 
Tk:: Widget:: stashes to the main stash. In 
effect, it provides a search path to look up 


symbol names from either module, even if 
the module does not implement an object 
class of its own. Perl does not require an 
object hierarchy to import modules, but 
may import symbol names statically. 

In practice, Perl class hierarchies tend 
to be only one or two levels deep. The 
language resolves inconsistencies in ex- 
ternal symbol name resolution by pro- 
viding a UNIVERSAL abstract superclass 
that provides packages with default mech- 
anisms for class membership and name res- 
olution; namely, the can and isa functions. 
Either of these may be called as a class 
method or statically. The client module need 
not declare a class membership at all. 

Unlike other object-oriented languages, 
Perl does not enforce an inheritance con- 
struct. Because even relatively low-level 
features of the language are available to 
a program script, each module can use 
the data suited to the application, pro- 
vided that the programmer is aware of the 
potential issues and difficulties that this 
flexible approach can incur. 


DDJ 





Listing One 
(a) 


*main::MainWindow:: => 
{ InitBindings => 
*MainWindow: :InitBindings 
viewable => 
*MainWindow: : viewable 
::_configure => 
*MainWindow: :_configure 
*etc% 


} 


if( exists ${*I0\:\:File\:\:}{VERSION} ) { 
my( S$version_name, $module_version ) = 
%{*IO\:\:File\:\:} {VERSION} ; 
} 


(c) 


my (Sarray_key, Sarray_val) = %{*Module\:\:array}; 
my Sarray_name = ${*{Sarray_val} {SCALAR}}; 
my Sarray_contents = @{*{S$array_val} {ARRAY}}; 


foreach ( my (Skey, $val) = %{*Module\:\:} ) { 
local (*entry) = $val; 
if( defined *entry{ARRAY}) { 
foreach( @{*entry{ARRAY}} ) {print "$_, "; } 


if( defined *entry{HASH}) { 
foreach( my (Skey_1, $val_1) = %{*entry{HASH}}) { 
print "Skey_1=>$val_1, "; 


} 

if( defined *entry{SCALAR}) { 
print ${*entry{SCALAR}}."\n"; 

} 


Listing Two 


(a) 


package extraModule; 
require Exporter; 
@EXPORT=qw(readData) ; 
@EXPORT_OK=qw(VERSION ISA) ; 
sub readData{ 

. program code_... 
} 


(b) 


package mainModule; 
use extraModule qw(VERSION ISA readData) ; 


(c) 


my Sextra_data = ExtraModule::readfile( $filename ); 


Listing Three 
(a) 


ny *keylist; 


# Create a new namespace for package Spkg. 
unless( exists ${"Spkg\:\:"} {VERSION} ) { 


eval "package Spkg"; 
eval "use Spkg"; 
eval "require Spkg"; 


} 
while( my ($key, $val) = each %{*{"Spke\:\:"}} ) { 


if( defined $val ) { 


local (*v) = $val; 


# test for pkg to make sure we get the right stash. 
) 


if( ($val =~ /Spkg 


&& (Sval =~ /VERSION/ ) ) { 


$m -> {version} = ${*v{SCALAR}}; 


} 

if( ($val =~ /Spkg/) && ($val =~ /ISA/ ) ) { 
$m -> {superclasses} = "@{*v{ARRAY}}"; 
} 


Skeylist{$key} = $val; 
} 


} 


(b) 


eval "package Lib::Module"; 


eval "use Lib::Module"; 


eval "require Lib::Module"; 


Listing Four 


(a) 


package Lib: :Module; 
sub new { 
my Sproto = shift; 


my $class = ref( S$proto ) |; $proto; 


my Sself = { 

children => [], 
parents => '', 
pathname => '', 
basename => '', 
packagename => '', 
version => '', 
superclasses => undef, 
baseclass => '' }; 
bless( Sself, Sclass); 
return $self; 


} 
package Tk: :Browser; 
use Lib: :Module; 


my $m = new Lib: :Module; 


$m -> module_info( S$packname ); 


(c) 


package Tk::Browser; 


my $m = &Lib::Module::module_info( $packname ); 
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Java and the 





Waba Toolkit 





Java in the palm 
of your hand 





Al Williams 


f you own a PalmPilot or a Windows 

CE-based handheld, chances are you’ve 

thought of programs you'd like to write 

for it. Unfortunately, programming these 
small machines can turn into a big job. 
However, if you know Java and have or- 
dinary Java development tools, you can 
develop a wide range of programs that 
run on the Palm or Windows CE with a 
development kit known as “Waba” (http:// 
www.wabasoft.com/). Waba is a freely 
available software kit released under a li- 
cense similar to Linux. It provides a sub- 
set of Java that will run on either hand- 
held device (CE or Palm), or any machine 
that runs Java—you can even embed a 
Waba applet in a web page. In this arti- 
cle, Pll show how you can simply write a 
Waba koverse Polish Notation (RPN) cal- 
culator that will run on a handheld de- 
vice. The complete source for the calcu- 
lator is available electronically; see 
“Resource Center,” page 5. 

Unlike some tools, Waba doesn’t include 
a compiler, debugger, or much else in the 
way of tools. Instead, it uses your exist- 
ing Java development tools. Of course, 
you can download the basic JDK from Sun 


Al teaches courses on various web-related 
technologies through Wintellect (http:// 
www.wintellect.com/). He can be con- 
tacted at alw@al-williams.com. 
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for free, but you might find an IDE from 
one of the major tool vendors more to 
your liking. Waba doesn’t care which tools 
you use. 

To run a Waba program on a hand- 
held device, you need a small download 
(40-60 KB depending on platform). This 





Waba run time installs on the handheld 
and executes the Waba programs you 
create. The Waba developer’s kit is in a 
ZIP file that is around 360 KB—very lean 
compared to most development tools. 
This is especially true if you consider a 
great deal of the development kit is doc- 
umentation and examples. You can also 
get the Waba source code in another 160 
KB download. Of course, you don’t re- 
ally need the source code unless you 
like to tinker. If you are using a non- 
Windows PC, you'll also need to down- 
load the binary tools for your platform 
(you can download them for Linux, So- 
laris, BeOS, MacOS, AmigaOS, and 
OS/2). There is also a third-party set of 
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tools written in Java that should run prac- 
tically anywhere. 


The Basic Idea 
Writing a Waba program is much like writ- 
ing a Java program. However, Waba pro- 
vides a complete class hierarchy that re- 
places the standard Java classes. Many of 
these classes mimic the normal classes (for 
example, String or Graphics). Others are 
unique to Waba. For example, Main Win- 
dow is similar (but different) from the usu- 
al Java Applet object. 

Waba provides the following packages: 


e waba.lang. Object, String, and String- 
Buffer classes. 

e waba.sys. Classes for time, date, type 
conversion, and access to machine re- 
sources. 

e waba.util. The Vector class. 

e waba.ui. Classes that provide user- 
interface elements (buttons and labels). 
It also provides support for timers and 
events (including pen events). 

e waba.io. Access to databases, files (the 
Palm has no files), serial port, and net- 
work sockets. 

e waba.fx. Classes for colors, fonts, im- 
ages, and sounds. 


You can also add some extra classes 
that are available from other authors, in- 
cluding several enhanced user-interface 
packages. 

Once you generate your class file, you'll 
use special tools to create a Waba exe- 
cutable that can run on the handheld de- 
vice. One tool (warp) creates special files 
tailored for the CE or Palm environment. 
The other tool (exegen) creates icons for 
both systems that can start the program. 
The exegen program also sets the 
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(continued from page 92) 
window size, memory allocation, icons, 
and other details. 


Anatomy of a Waba Program 

A Waba program typically extends Main- 
Window, the class that represents the 
Waba program and its primary window. 
From this object’s onStart function, you 
can create controls and perform other ini- 
tialization. The onEvent function is the 
main clearinghouse for events (keystrokes, 
button press, and pen events). 

Many useful Waba programs won't need 
more than a Main Window -derived class 
that contains an onStart and an onEvent 
function, along with whatever custom 
functions you need to do the actual work. 


A Waba Example 

As a long-time user of HP calculators, I’ve 
often wanted a simple Windows CE RPN 
calculator. Why not use Waba to make a 
calculator that will run on CE or the Palm? 
You’d even be able to put the calculator in 
a web page or use it on any Java-enabled 
platform. Figure 1 and Listing One present 
the complete calculator. 

When you design a Waba program, you 
must be aware of the limitations of Waba 
when compared to ordinary Java. For ex- 
ample, it is important to realize that Waba 


does not support the double data type. 
That means the calculator will use the float 
data type (and accept the resulting loss of 
precision). Initially, you might consider 
reading numbers as a string and then con- 
verting the string into a float. This would 
be easy to do with Java, but Waba han- 
dles conversions a bit differently. 

The waba.sys.Convert object has many 
static functions (for example, toString) that 
converts basic types to strings and vice 
versa. However, the object lacks a func- 
tion that converts a string to a float. There- 
fore, the calculator program has to pro- 
vide code to convert key presses into a 
floating-point number. Of course, the in- 
herent lack of precision in the float class 
means that entering, for example, 1.2345 
might actually display 1.2344999. 

Another problem with the lack of dou- 
ble support is constants. When you write 
a floating-point constant (ike 3.141) in 
Java, it has an implicit type of double. 
When the Waba interpreter tries to load 
the constant, it will generate an invalid op 
code error. The answer to this problem is 
to use the f suffix on constants to force 
their type to float (as in 3.141f). 


Inside the Calculator 
In Listing One, notice that the alcalc class 
begins with a few simple member vari- 


ables. Instead of using a separate variable 
for each button, the calculator uses an ar- 
ray. A label control (Ved ) serves as the cal- 
culator’s display. 

Another array (stack) serves as the cal- 
culator’s stack (four deep in this exam- 
ple). There are a few other simple vari- 
ables to track input state and the stack’s 
current status. The final two variables in 
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Figure 1: A Waba-based calculator 
running under Windows CE. 
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(continued from page 94) 
this section, wid and hi, store the dimen- 
sions for normal-sized buttons. 

Since the calculator’s stack is circular, 
the addsp function handles incrementing 
and decrementing the stack pointer (sp). 
The function simply wraps the stack point- 
er around when it falls past zero or in- 
creases beyond the end of the stack. 

To simplify the user-interface setup, the 
setBin function handles the population of 
the button array. The calling code sup- 
plies the row and column number for the 
button as well as the button’s caption and 
relative width. A width of one is the nor- 
mal width, but some buttons occupy twice 
the space (a width of two). The size and 
actual location of the buttons depend on 
the host device’s screen size. The onStart 
routine begins by finding the screen size 
and setting the wid and hi variables to ap- 
propriate values. Then it is a simple mat- 
ter to call setBtn to populate the calcula- 
tor’s face with buttons. 

Most of the real work occurs in the on- 
Event function. This is where the calcula- 
tor processes events that Waba generates. 
The first order of business in this function 
is to detect KEY_PRESS events. These 
events occur when users enter characters 
via the keyboard or a pen input method 
dike Jot or Graffiti). 

To keep the code simple, the calcula- 
tor effectively converts key events into a 
corresponding button index. In the case 
of a real button push (a PRESSED event), 
the code searches the button array and 
determines the index. Either way, the re- 
maining code handles a button index, not 
the actual event. 

Like most event-handling code, on- 
Event’s main section uses a switch state- 
ment. Variables control the state of the in- 
put, as follows: 


e /ift. This variable is 1 if there is a new 
number being entered. 

e dp. If 0, users are entering the whole 
part of a number. If dp is not 0, it indi- 
cates the scale of the fractional input. 
For example, if dp is 100, this indicates 
the second digit beyond the decimal 
point. 





Listing One 


// RPN calculator for CE or Palm 
// Al Williams 


import waba.ui.*; 
import waba.fx.*; 
import waba.util.*; 


public class alcalce extends MainWindow 


final int stacksize=4; // Set stack depth 


// Buttons and display 
Button btns[] = new Button[18]; 
Label led = new Label("%.%",Label.RIGHT) ; 
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e neg. The algorithm used to build the 
number assumes that the input number 
is positive. This variable remembers if 
the number is negative so it can restore 
the correct sign if necessary. 


All the numbers use the same routine. 
Each function key (the “+” key, for 


You can even embed 
a Waba applet in a 
web page 





example) has a small piece of separate 
code that performs the correct operation. 


Debugging Waba 

One of the nice features of Waba is that 
you can get your program operating on 
your workstation using Java. Simply com- 
pile your program with the javac program 
as usual to produce class files. To run the 
alcalc program, run: 


java waba.applet.Applet alcalc 


When you are running on the work- 
station you can even use System.out.print- 
In to display messages on the console. 
Remember, the library you are using is not 
exactly the Waba library in this case. For 
example, the standard run time doesn’t 
support the serial port. The Waba Extras 
package does have a replacement library 
that supports the serial port, but I had 
trouble with a large program like alcalc 
receiving key events when using the Ex- 
tras package (a simple test program 
worked, though). 


// This is the stack 


However, you can get most, if not all, 
of your program working before you both- 
er downloading to the handheld comput- 
er. If you've used any calls to printin, you'll 
want to remove them before you do a fi- 
nal build. 


Building the Calculator 

You can’t download class files directly 
to the handheld device. Instead, you 
must run the warp utility to generate a 
wrp file (for Windows CE) and a pdb file 
for the Palm. In addition to these files, 
you'll also want to run exegen to create 
icons for both platforms (either an Ink 
file or a prc file). You can wrap these 
commands up in a batch file (see List- 
ing Two). 

The Waba documentation explains how 
to install files on each platform. For Win- 
dows CE, it is simply a matter of copying 
files to the correct locations. For the Palm, 
you'll run the Palm’s installation program. 
Either way, you'll wind up with an icon 
that will run the program. Of course, you'll 
also have to install the Waba run time once 
on the handheld machine. 


What's not in Waba? 

Waba does not currently support long and 
double data types. It also does not sup- 
port exceptions or threads. However, for 
many applications this is not a serious 
problem. Of course, the source code for 
Waba is available, so if you are particu- 
larly intrepid, you could add support for 
what you wanted yourself. 

For example, if you tried to add more 
scientific functions to the calculator, you’d 
find that Waba doesn’t support many more 
math functions. However, someone has 
written a replacement math library (see 
the resources). There’s no reason you 
couldn’t do the same. 
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float stack[] = new float[stacksize]; 


float dp=0.0f; 


int sp=0; 


// stack pointer 


// decimal point scaling 


int lift=0; // stack lift (new entry) flag 


// Scale factors for the current window 


int hi; 
int wid; 
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// Set incr==1 to increment; -1 to decrement 
// circularizes based on stacksize 
int addsp(int sp,int incr) 
{ 
if (incr==-1 && sp==@) return stacksize-1; 
if (incr==1 && sp==stacksize-1) return 9; 
return sptincr; 


// set up a button 
// x,y are row and column NOT pixels 
// width is 1 for single-width, 2 for double width 
void setBtn(int i,String lbl,int x,int y, int width) 
{ 
btns[i]=new Button(1b1l) ; 
btns [i] .setRect (x*(widt5) ,y*(hit5) ,width*wid,hi) ; 
add(btns[i]); 


} 
// Set up all the work 
public void onStart() 


Rect r; 
r=getRect(); 


hi=r.height/7; 
wid=r.width/5; 
stack[sp]=0; 

// display 
led.setRect(0,0,r.width-1@,hi) ; 
add(led) ; 

// set up all buttons 
setBtn(0,"0",1,4,1); 
setBtn(1,"1",0,3,1); 
setBtn(2,"2",1,3,1); 
getBtn(3 , "3",2,31)% 
setBtn(4,"4",0,2,1); 
setBtn(5,"5",1,2,1); 
setBtn(6,"6",2,2,1); 
setBtn(7,"7",@,1,1); 
setBtn(8,"8",1,1,1); 
setBtn(9,"9",2,1,1); 
setBtn(10,"Enter",@,5,2); 
setBtn(11,"+",3,1,1); 
setBtn(12,"-",3,2,1); 
setBtn(13,"*",3,3,1); 
setBtn(14,"/",3,4,1); 
setBtn(15,"+/-",0,4,1); 
setBtn(16,".",2,4,1); 
setBtn(17,"x<>y",2,5,2); 


// display number at top of stack 
void disp(int newlift) 
{ 
led.setText (waba.sys.Convert.toString(stack[sp])); 
lift=newlift; 


// Handle key pressses 
public void onEvent (Event event) 
{ 
int i=-1; 
int leaddp=0; // leading decimal point? 
int neg=1; // 1 means positive number 
if (event.type==KeyEvent .KEY_PRESS) 
{ 


KeyEvent kevent = (KeyEvent) event; 

if (kevent.key==IKeys.ENTER) kevent.key='\n'; // Enter key 
// convert key to button # 

String keys="0123456789\nt-*/ ."; 

for (i=0;i<keys.length() ;i++) 

if (keys.charAt(i)==kevent.key) break; 

if (i==keys.length()) return; 

} 


else if (event.type == ControlEvent.PRESSED) // button 
{ 
for (i=@;btns[i]!=event.target;it+); // scan for button 
} 
if (i!=-1) 
{ 
switch (i) 


case 16: // decimal point 
leaddp=1; 
if (dp==0.0f) dp=10.0f; 
if (lift==0) break; // if not first entry, nothing special 


leaddp=0; // ignore lead dp flag if old number 


if (stack[sp]<@) // get absolute value of tos 
{ 


neg=-1; 
stack[sp]=-stack[sp] ; 
} 


if (dp==0.0f) // whole number 
{ 


stack[sp]=stack[sp] *10+i; 
} 

else // fractional number 
{ 
stack[sp]=stack[sp]+i/dp; 
dp*=19; 


// if number was negative, make it negative again 
stack [sp]=stack[sp] *neg; 
disp (®); 
if (leaddp==1) dp=10.0f; 

break; 

// enter key 

case 19: 
i=addsp(sp,1); 
stack [i]=stack[sp]; 
sp=i; 

lift=1; 
break; 
/] * 
case ll: 
i=sp; 
sp=addsp(sp,-1); 
stack[sp]=stack[sp]+stack [i] ; 
disp(1); 
break; 

case 12: 
i=sp; 
sp=addsp(sp,-1); 
stack [sp]=stack[sp]-stack [i]; 
digp( 2): 
break; 

// * 

case 13: 
i=sp; 
sp=addsp(sp,-1); 
stack [sp]=stack[sp] *stack [i] ; 
disp(1); 
break; 

// / 

case 14: 
i=sp; 
sp=addsp(sp,-1); 
stack[sp]=stack[sp]/stack [i] ; 
disp(1); 
break; 

{/ +/- 

case 15: 

stack[sp]=-stack[sp]; 
disp (lift) ; 
break; 

// xOY 

case 17: 
float tmp; 
i=addsp(sp,-1); 
tmp=stack[sp] ; 
stack[sp]=stack [i]; 
stack [i]=tmp; 
disp(1); 
break; 

} 
} 


Listing Two 
@echo off 


rem compile program, build warp files and launchers 
javac *.java 

warp c /q alcale *.class 

exegen /q /i icon.bmp alcale alcalc alcalc 

echo To test program: 

echo java waba.applet.Applet alcalc 


i=0; // otherwise, treat like a @ key 
// fall into ® case 
case @: 
case 1: 
case 2 
case 3 
case 4 
case 5: 
case 6: 
case 7: 
case 8 DDJ 
case 9: // digits 

if (lift==1) // new number? 

{ 

sp=addsp(sp,1); 

stack [sp]=0; 

lift=0; 

dp=0.0f; 

} 

else 
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An efficient scheduler 
can make the difference 





John Pote 


ardware limitations on a small data 
logger I was designing meant that 
three interrupt sources were fed into 
the microcontroller via one interrupt 
pin. The microcontroller at issue was the 
Texas Instruments MSP430C337, which 
has outstandingly low-power consump- 
tion. But experience revealed that even 
with the maximum clock input of 1.8 MHz 
at 3.3V supply, it was not terribly quick. 
One of the interrupt sources was a UART 
requiring reasonably fast servicing, the 
second was in response to changes in the 
modem control lines of the associated 
RS232 port. The third was the inter- 
rupt/busy line from an ATA PC card con- 
troller. The modem control lines had less 
priority than the UART, while the PC card 
interrupt had no timing requirements at 
all and could simply wait until the pro- 
cessor time had time to deal with it. 

Because there are three sources on one 
pin, the initial interrupt code has to re- 
spond fast enough to determine the in- 
terrupt source, clear that source so other 
interrupts can get through, then jump to 
the relevant handler. There is one handler 
for each interrupt source. At some early 
stage, at least before entry to the handler, 
global interrupts need enabling to permit 
other interrupts. Since the handlers may 
be slow, a second interrupt might occur 
before the first has terminated. Thus, ide- 
ally, everything should be reentrant. 

A single high-priority hardware inter- 
rupt needs to trigger a number of varying 
and lower priority handlers. A set of ad 
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hoc flags and appropriate tests at the end 
of the processor interrupt pin handler and 
interrupt source handlers could no doubt 
deal with the situation. However, should 
additions to the system be needed, then 
all the handlers would need to be modi- 
fied. This was felt to be a source of po- 
tential problems— it would be so easy to 
forget to make appropriate changes to all 
the handlers. Clearly, what I needed was 
an efficient scheduler. 


The Scheduler 
The scheduler I present here was devised 
to meet the needs of the system just 








es ca 


described and allow simple extension if 
more handlers are needed or the priori- 
ties change. A hardware interrupt routine 
determines the interrupt sources and logs 
requests against the appropriate handlers. 
The scheduler is then called. It calls the 
requested handlers in prioritized order un- 
til no requests are pending when it returns 
to the original background routine. Op- 
eration of the scheduler is such that a han- 
dler may also issue handler requests, 
which are then dealt with in the priori- 
tized order as usual. The scheduler runs 
with interrupts enabled and is fully reen- 
trant but will not cause reentry to a han- 
dler. This simplifies the handlers because 
they do not need to be reentrant, mini- 
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mizing their stack use. This is useful for 
an MSP430C337 as stack space is restrict- 
ed— only 1-KB SRAM is available. Figure 
1 shows how interrupts cause the various 
bits of code to run for (in this example) 
the UART interrupt and handler and the 
lower priority modem control line inter- 
rupt and handler. When the modem in- 
terrupt occurs, a second instance of the 
scheduler starts, and finds the UART han- 
dler running, so it immediately returns to 
the interrupted UART handler. Upon com- 
pletion, the UART handler returns to the 
scheduler (first instance), which finds a 
pending modem handler request, so it 
runs the modem handler. 


Priority List 

The scheduler is essentially a short routine 
and a linked list. Example 1 is the struc- 
ture of each list member. Individual han- 
dler routines are pointed to by each list 
member and, along with this pointer, are 
a request count, RUNNING flag, and a 
pointer to the next list member (held as a 
byte offset to conserve memory). This list 
is processed from head to tail so the high- 
est priority member is at the head and low- 
est at the tail. A dummy member at the list 
tail has its RUNNING flag permanently set. 
This forces termination of the scheduler. 


Scheduler Algorithm 

Example 2 is pseudocode for the sched- 
uler. First, an interrupt occurs and the hard- 
ware interrupt routine runs. This accesses 
the external hardware, clears the interrupt 
source, and increments the request count 
in the relevant handler member in the 
linked list. Interrupts are enabled, and the 
interrupt routine jumps to the scheduler. 
Eventually, the scheduler will terminate by 
executing a refi (“return from interrupt”) 
instruction and the original background 
routine resumes. Starting at the head and 
working towards the tail, the list is exam- 
ined. If the request count for a particular 
member is not zero, then that handler is 
called. When the handler returns, the re- 
quest count is decremented and the pro- 
cess repeated. When the request count 


http:/www.ddj.com 











ae wait is over. 
With National Instruments Measurement 
Studio", you can quickly and easily create 
measurement and automation applications 
with your choice of programming 
languages. Measurement Studio 
provides a complete set of fully 
integrated measurement tools for 
Microsoft Visual Basic, Visual 
C++, and National Instruments 
LabWindows"/CVI. 


Measurement Studio delivers 
standardized, reusable, integrated 
measurement tools you need to 
acquire, analyze, and present your 
data faster than ever before. 


e Instrumentation controls — 
GPIB, Serial, VXI! 

e Data acquisition — analog I/O, 
digital 1/O, counter/timer 

e Visualization tools — 2D and 3D 
graphs, knobs, switchers, sliders 

e Measurement analysis — digital filtering, 
statistics, frequency-domain analysis 





Viste nkeony/into/mstudia 
10 request Vary: FREE 
lavalttaiion (fo 





NATIONAL 
INSTRUMENTS 





ni.com/info/mstudio 
Tel: (512) 794-0100 ¢ Fax: (512) 683-9300 ¢ info@ni.com 


© Copyright 2000 National Instruments Corporation. All rights reserved. Product and company names listed are trademarks or trade names of their respective companies. 








(continued from page 104) 

reaches 0, the next member of the list is 
selected and the process is repeated. Thus, 
each handler is called once for each re- 
quest and in the priority order. 


The RUNNING Flag 

Basic operation is modified by the RUN- 
NING flag. As soon as the scheduler begins 
to process a list member, its RUNNING flag 
is set. It is cleared on completion of that it- 
eration. If it is found to be set, then the cur- 
rent scheduler instance knows that a prior 
instance of the scheduler is already running 
or about to run that handler. In this case, 
this newer instance simply terminates, which 
allows the prior instance to process the han- 
dler request counts. However, an interrupt 
may occur when the scheduler is still ex- 
amining a list member but at a point where 
the RUNNING flag is clear. In this case, the 
new instance of the scheduler finds the flag 
clear, so it runs the handler (and, in fact, all 
handlers with requests in the priority list). 
On returning to the first instance, it finds 
all the requests dealt with (that is, request 
counts are 0), so it does not run any han- 
dlers as it continues to scan down the list 
and finally terminate. 

The final member in the list is a dum- 
my member with its RUNNING flag set. 
There will never be a scheduler instance 
that set this particular flag, so it will nev- 
er be cleared. Hence, it provides a per- 
manent end stop to the list because it will 
always cause the scheduler to terminate. 


Code for the Scheduler 

Listing One is the MSP430 code for the 
scheduler. Masks for the request count and 
the RUNNING flag are held in registers la- 
beled rReqCntBits and rRunning. This 
makes the code faster than using immedi- 
ate mode each time the masks are need- 
ed, because immediate mode constants 
need an extra access cycle and cannot be 
used for the second operand of an in- 
struction. The register 7ListPtr points to the 
current list member. As the scheduler is ef- 
fectively the end of an interrupt routine, it 
is left to the interrupt routine to save the 


scheduler’s working registers. The interrupt 
routine can then use these registers rather 
than saving and restoring its own working 
registers — and a few more cycles are 
saved. The scheduler runs with interrupts 
enabled. Careful testing has shown that it 
may be interrupted without problems at 
any point. The new interrupt may issue 
handler requests and jump to the sched- 
uler as usual, creating a second instance 
of the scheduler. This second instance will 
run all handlers with request counts greater 
than 0 and RUNNING flags clear. If the first 


Figure 1: 
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How interrupts cause the various bits of code to run. 


instance is already running a handler, then 
the RUNNING flag will be set and the sec- 
ond instance will terminate and return con- 
trol to the first instance. If the second in- 
stance does not find the RUNNING flags 
set, then it will run all the handlers with 
request counts greater than 0 before ter- 
minating. The first instance then continues, 
but will find all request counts 0 and, thus, 
do nothing more. 
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Listing One 


RUNNING .equ 1 
REQCOUNT equ Z 
REQCNTBITS .equ Ofeh 


;RUNNING flag bit mask 
;i.e. not the running bit 


;rListPtr, rRunning, rReqCntBits are 
: macro names for general-purpose registers. 
scheduler: 

mov.w &schedListFirst, rListPtr 
mov.w #RUNNING, rRunning 
mov.w #REQCNTBITS, rReqCntBits 
;is current function already running? 
bit.b @rListPtr, rRunning 
jnz schedEnd 


doElem: bis.b 


;request count inc/dec value 


rRunning, SchedListElem. reqCnt (rListPtr) 


call 
sub.b 
zCount: 
bic.b 
bit.b 
jnz doElem 


rRunning, SchedListElem.reqCnt(rListPtr) 
@rListPtr, rReqCntBits 


SchedListElem. funcPt (rListPtr) 
#REQCOUNT, SchedListElem.reqCnt (rListPtr) 


;dec req count 


;clr this func running 
;req cnt > @? 


;req count is 0, do next func 


mov.b 


;point to list head 
; the next 2 are useful constants to have in registers 


SchedListElem.indxNxt(rListPtr), rTemp 
add rTemp, rListPtr 


;->next element in list 


;repeat first instrs here for speed 


;is new function already running? 


bit.b @rListPtr, rRunning 
jz doElem 

schedEnd: 
popSchedRegs 


reti 


;mark this func running 


bit.b @rListPtr, rReqCntBits 
jz zCount 
;call the func 
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;req cnt > 0? 
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;macro to restore scheduler registers 
;terminate scheduler and interrupt 
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can work—if you 


follow the rules 





David Janik 


common problem faced by pro- 
gramming teams is the ability to pro- 
duce consistent, reliable, and main- 
tainable modules. One solution to 
this is to develop a set of coding conven- 
tion guidelines that are adhered to by all 
team members. Typical coding conven- 
tions deal with syntax issues at the low- 
est level. However, the module guidelines 
I present in this article deal with module 
issues at the level above the syntax— that 
is, at a level of abstraction above the spe- 
cific source code. These guidelines are 
therefore intuitive and maintainable, and 
promote reuse. Ultimately, development 
teams that follow guidelines such as these 
produce modules that are cohesive, de- 
coupled, and encapsulated. For instance, 
we've used these guidelines in producing 
team- developed firmware for a 12-channel 
DS3/STS1 card of a digital cross-connect 
system. The card consists of several ASIC 
devices and is controlled by an internal 
LAN connection. In truth, the guidelines 
are generic for any C language module, in- 
cluding nonreal-time scenarios. However, 
the example queue and interrupt routines 
presented here show how you can apply 
the guidelines to real-time systems. 

The module convention is modeled after 
the class data type from object-oriented pro- 
gramming languages. There are three sec- 
tions to this convention: 


David is an engineer for Tellabs and can 
be contacted at david janik@tellabs.com. 
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e Protected data, the module-visible data 
structure. The visibility is only within 
the module. This does not include the 
local variables used within a function. 
External functions —‘“methods,” “ser- 
vices,” or “callable functions” other mod- 
ules will invoke. External functions are 
the API for the other modules in the sys- 
tem to use. External functions can in- 
voke their own internal functions, ac- 
cess their own protected data, and 
invoke the external functions of other 
modules. 





e Internal functions, the “private” func- 
tions of a module. Internal functions 
have access to their own protected data 
and can invoke another module’s ex- 
ternal functions. 


Figure 1 is a simple model of module 
ABC showing the relationship between 
the abc_ProtectedData, abc_Internal- 
Functions, and abcExternalFunctions. It 
also shows the calling relationship be- 
tween modules ABC and XYZ. Module 
ABC's internal or external functions can 
call only external functions of module 
XYZ. This module model convention fa- 
cilitates data encapsulation. Module co- 
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Module Design Guidelines 
for Real-Time Systems 


hesion and decoupling characteristics are 
realized from the software architecture. 

Figure 1 also illustrates the simple 
naming convention. External functions 
have the module name as a prefix 
(xyzExternalFunction(), for instance), 
while internal functions and protected 
data have the module name prefix and 
the underscore character (xyz_internal- 
Function() and xyz_protectedData, re- 
spectively). The intent of this naming 
convention is to intuitively identify ex- 
actly where the function or data resides 
when reading the source code. Conse- 
quently, there is no need to grep or 
cscope for a function. 


Module Model Source Files 
There are three types of source file struc- 
tures associated to this module model: 


e Simple. All the source code for exter- 
nal functions, internal functions, in- 
ternal task function, and protected data 
structure reside in a single file, such 
as Listing One(a). The prototypes of 
the external functions and any #de- 
fines, enums, and the like, needed by 
the external functions, reside in a file 
like Listing One(b). The application- 
specific header file, Listing One(c), is 
the last file and defines any application- 
specific information the module 
needs to work for the particular ap- 
plication. 

Example 1 is an example of where 
the module source files would reside in 
a typical source directory. The applica- 
tion directory contains four subdirecto- 
ries. For this application, three modules 
are required. 

e Divided. The difference between sim- 
ple and divided file structures is that the 
external functions, internal functions, 
and protected data are separated into 
different files. All the source code for 
the external functions reside in one file, 
like that in Listing Two(a). The internal 
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External Functions 
ex: abcExternalFunction() 


Internal Functions 
ex: abc_internalFunction() 


Protected Data 
ex: int abc_protected Data() 


Module 
XYZ 
External Functions 


ex: xyzExternalFunction() 


Internal Functions 
ex: xyz_internalFunction() 


Protected Data 
ex: int xyz_protected Data() 





Figure 1: Simple example (modules ABC, XYZ source files). The red arrows 
represent function calls and the black arrows are protected data access. 
(Unheritance and polymorphism are not available in this model.) 
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(continued from page 108) 
functions reside in another file, such as 
Listing Two(b). The internal function 
prototypes and any other #defines, 
enums, and the like, needed to use the 
internal functions reside ina file like 
Listing Two(c). The protected data struc- 
ture resides in a file such as Listing 
Two(d), while the prototypes of the ex- 
ternal functions (and any #defines, 
enums, and so on) reside in a file like 
Listing Two(e). Finally, the application- 
specific header file is the same as the 
simple file model; see Listing Two(f). 
Example 2 illustrates where the module 
source files would reside in a typical 
source directory for divided file structures. 

e Distributed. The distributed file struc- 
ture further divides the external func- 
tions into related files; see Listings 
Three(a) and Three(b). Also, the internal 
functions are divided into related files, as 
in Listings Three(d) and Three (f) (ex: 
abc_task.c, abc_utilities.c). The internal 
function prototypes and any other #de- 
fines, enums, and so on, needed to use 
the internal functions reside in the files, 
as in Listings Three(e) and Three(g) 
(abc_task.h, abc_utilities.h). The Protect- 
ed data structure resides in another file, 
as in Listing Three(h) (abc_map.h). The 
prototypes of the external functions and 
any #defines, enums, and the like, re- 
side in the file (abc.h) see Listing 
Three(c). The application-specific head- 
er file is the same as the SIMPLE file 
model (abc_appSpec.h), see Listing 
Three). Example 3 shows the file struc- 
ture for distributed modules. 


Listing Three illustrates the simple file 
naming convention. The external func- 
tions have the module name as a prefix 
(abcMessage.c). Internal functions and pro- 
tected data have the module name prefix 
and the underscore letter (abc_task.c, 
abc_map.h). The intent of the XXX_app- 
Spec.h file is to group all the application- 
specific parameters into one file. This 
makes the module easier to reuse by only 
modifying one source file. Also note that 
there is no nesting of header files. 


Module Types: Passive and Active 
The two types of modules are passive and 
active. A passive module type is one 





Figure 2: Queues, events, and 
mailboxes. 
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where there are no real-time operating 
system (RTOS) tasks running. It is a simple 
function call and return. A passive mod- 
ule can use a semaphore to control ac- 
cess to its protected data. 

An active module, on the other hand, 
is a module type that has at least one 
RTOS task created. The active module may 
be driving data into passive modules pe- 
riodically or waiting for a message to be 
put onto its queue: 


e Tasks. A task is a function with an infi- 
nite loop. There are two basic types of 
tasks — periodic and asynchronous. The 
periodic task has an associated RTOS 
timer that wakes at a defined frequency. 
The asynchronous task waits on a queue, 
event, mailbox, and so on. The asyn- 
chronous task wakes when a message is 
posted to the queue, event, mailbox, and 
the like. Typically, another task or inter- 
rupt routine will post a message. When 
the task runs, it will read the queue, 
event, mailbox, or whatever, and process 
the message. The task function is an in- 
ternal function for the module. 

e Queues, events, and mailboxes. A mes- 
sage processing task needs a queue so 
other tasks can post a message to it. Some 
system designs have the queue as pub- 
lic or global and is visible to all modules. 
The structure of the message also needs 
to be global to all modules that post a 
message to the queue; see Figure 2. 


For the module model presented here, 
the queue is a protected data structure and 
not visible to the other modules. The oth- 
er modules will call external functions of 


a module and these functions will post a 


message to the task queue; see Figure 3. 

Modules ABC and XYZ do not have ac- 
cess to the queue pdq_queue since it is a 
protected data structure of module PDQ. 
Module ABC calls the external function 
pdgService1(), which puts a message onto 
the queue pdg_queue. Also, module XYZ 
calls external function pdgqService2(), 
which puts a different message onto the 
queue pdq_queue. 

The advantages of this type of design 
are cleaner interface to the PDQ module, 
maintainability, the format and message 
content for the queue may change with- 
out affecting modules ABC or XYZ, and 
data encapsulation (the only functions 
posting messages to the PDQ queue are 
within the PDQ module). 


Interrupt Service Routines 

The Interrupt Service Routine CSR) can 
be internal or external functions, de- 
pending on how the hardware is de- 
signed. If the hardware has an interrupt 
controller device and all the interrupts 
go through this device, then all of the 


http://www.ddj.com 


APPLICATION 
-include 


/* #include "pdq.h" 


to use services of module PDQ */ 
-abc_appSpec.h 
fe 


-xyz.c /* #include "pdq.h" to use services of module PDQ */ 
-xyz_appSpec.h 

-PDQ 
-pdq.c 
-pdq_appSpec.h 


Example 1: Where module source files might reside in a typical simple source 
directory. 


Tntroducing Multi*dit 9, the neseme 
1 to work with all the languages | 
Sonu rneyereo(eravem \@ieameatem arco)iicorve(oyeme) amcatem Colcaueceartecemearcmpe Ware menmmnunllerteces 


editor designec 


now used to develop Veen sence lematalce Merelemcrahusceratnatcelanecespemvualceemcomuntcenre 


maintain all of vour code. Multi-Edit 9 has many new features that will tmproy 
(Oeics (Com Oho ae Orbe rer mucleleel es coe 
Sott/ Export iii 
Www.sott export.com pt 
Info softexport.com 
International Inquiries: +553 1 2 
Germany: O1S0 860341 

. OQSOOQ-905823 

OSOQ 973098 


Improved File Compare With Merging ° 
Enhanced Embedded Language Handling : 
Language Support For: - 

PHP. XML, FCL/TK, REBOL, HIMLscnp 
Xbase++. Python and Numerous Others 


American Cybernetics Ine. 
Sales: (SOO) 899-0100 
480) 968-1945 


(480) 966-1654 


AWeyreres SOE. 
PANS Re Pee a age re Ree a 


sales multicdit.com ates. OF 


Dr. Dobb’s Journal, February 2001 








‘ ] > | +*~\1 +4 ¢ 
vcs VOu use oe ORAS GAVE COI 


| 


| 


ce 


reyeore teCcrenwtawmveCe me cexe lecerem wovel axe ccauvel(o) prberernrereayelOmGnDETer 
Just a few of the new features: 

Visual Studio IDE Integration ° 

ColdFusion Studio Integration ° 

Enhanced HTML Support - 

COM Chient/Server Support ° 

Color Printing ° 

Improved User Interface - 


le 





111 











Module 
ABC 


External Functions 
ex: abcExternalFunction() 


Internal Functions 
ex: abc_internalFunction() 


Protected Data 
ex: int abc_protected Data() 


Module 
PDQ 
pdqService1() 
pdqService2() 


pdgq_messageProcTask() = 


RTOS_queue pdq_queue 





External Functions 
ex: xyzExternalFunction() 


Internal Functions 
ex: xyz_internalFunction() 


Protected Data 
ex: int xyz_protected Data() 


Figure 3: Message sending example (modules ABC, XYZ, PDQ source files). 


Device 
A 
Micro ees Device 
Processor Sh oS . B 
Device 
Device 
C 
Interrupt 
mere — 

if(DEVICE_A) 
{ 


/*module da*/ /*module db*/ /*module de*/ 


/*protected data*/ /*protected data*/ /*protected data*/ 
dalnterruptClear(); 
/*|Internal Functions*/ | | /*Internal Functions*/ || /*Internal Functions*/ 


if(DEVICE_B) 


{ 
dbinterruptClear(): 

/*External Functions*/ | | /*External Functions*/ || /*External Functions*/ 
if(DEVICE_C) dalnterruptClear() dbinterruptClear() dcinterruptClear() 


/*xclearDevice A*/ /*clearDevice B*/ /*clearDevice C*/ 


} } } 


{ 
deinterruptClear(): 


Figure 4: Interrupt controller example (modules A, B, C source files). 


Device 
A 
Micro Device 
Processor B 


Device 
CG 


/*module da*/ /xmodule db*/ /*module dc*/ 


/*protected data*/ /*protected data*/ /xprotected data*/ 


meee 


Vector 


/*internal Functions*/ N. /*|Internal Functions*/ 
db_InterruptClear() dc_InterruptClear() 


/*|Internal Functions*/ 
da_InterruptClear() 


/*clearDevice A*/ /*xclearDevice B*/ /*clearDevice C*/ 


} } } 


/*External Functions */ /* External Functions*/|| /*External Functions*/ 


Figure 5: No interrupt controller example (modules A, B, C source files). 
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APPLICATION 
--include 
-abce.h 
aye 
-pdq.h 
ABC 
-abe.c 
-abc_appSpec.h 
-abc_map.h 
-abc_internal.c 
-abec_internal.h 
“hye 
=ay2.c 
-xyz_appSpec.h 
DQ 


-pdq.c 
-pdq_appSpec.h 





Example 2: Where module source 


files would reside in a typical source 


directory for divided file structures. 


APPLICATION 
-include 
-abc.h 
neyc.h 
-pdq.h 
ABC 
-abcMessage.c 
-abcControl.c 
-abc_appSpec.h 
-abc_map.h 
-abe_task.c 
=abc task fi 
-abc_utilities.c 
-~abc_utilities.h 
eee 
=xy2.¢ 
-xyZ_appspec.h 
DQ 


—“pdd.c 
-pdq_appSpec.h 





Example 3: File structure for 
distributed modules. 


ISRs will be external functions— except 
the low-level driver module associated 
to the interrupt controller device. This 
will be an internal function that the in- 
terrupt vector points to. If the hardware 
has separate interrupt vectors for each 
device, then all of the ISRs will be in- 
ternal functions. Each interrupt vector 
points to the associated internal function 
ISR. Figures 4 and 5 illustrate the two 
models. 


Conclusion 

By following the guidelines for firmware 
development I’ve presented here, the mod- 
ules created will be more robust, main- 
tainable, and reusable because they will 
have the object-oriented properties of be- 
ing cohesive, decoupled, and data en- 
capsulated. 
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Listing One 
(a) 


/* Name: abc.c */ 
/* Description: */ 
/* Developer: */ 


#include abc.h 

#include abc_appSpec.h 

#include pdq.h /* prototype for pdqAction() */ 
#include rtos.h /* prototype for rtosStartTask() */ 


/*** Protected Data eK / 
typedef 
{ 


int mode; 
int *device; 
Jabe_struct 
abe_struct abc_data[ABC_INSTANCES] ; 


/*** Internal Functions Prototypes ***/ 
int abc_instanceCheck(int instance) ; 
int abc_messageCheck(int message) ; 


/*** Internal Functions ***/ 
int abc_instanceCheck(int instance) 


int result = ABC_ERROR; 
if (instance < ABC_INSTANCE) 
result = ABC_OK; 
are (result) ; 

[eK KK / 


int abc_messageCheck(int message) 


int result = ABC_ERROR; 
if (message == ABC_MESSAGE_1 ||; message == ABC_MESSAGE_2) 
{ 


result = ABC_OK; 
} 
return (result); 
/*** External Functions ***/ 
int abcInit() 
{ 


int i; 
for (i=@; i<ABC_INSTANCES; i++) 
{ 

abc_data[i] .mode 


Q; 
abc_data[i] .device (int *) (ABC_BASE_ADDRESS + (@x1@@ * i); 


} 
rtosStartTask(abc_task, ABC_PRIORITY) ; 
return (ABC_OK); 

} 

[*kK RK / 

int abcModeGet(int instance, int *mode) 


{ 
int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 
{ 

*mode = abc_data[instance] .mode; 

if (*mode == 1) 

{ 
pdqAction(*abc_data[instance] .device) ; 
abc_data[instance].mode = 9; 

} 

result = ABC_OK; 

} 
return (result); 
} 
[xk ***/ 
int abcMessageSend(int instance, int message) 
{ 
int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 
{ 

if (abc_messageCheck(message) == ABC_OK) 

{ 

*abc_data[instance] .device = message; 
result = ABC_OK; 

} 

} 

return (result); 
} 
/* Name: abc.h */ 
/* Description: «/ 
/* Developer: «/ 
#define ABC_ERROR 1 
#define ABC_OK 4) 


#define ABC_MESSAGE_1 al 
#define ABC_MESSAGE_2 2 


int abcInit (void) ; 


int abcModeGet (int instance, int *mode); 
int abcMessageSend(int instance, int message) ; 


(c) 


/* Name: abc_appSpec..h */ 
/* Description:  / 
/* Developer: */ 
#define ABC_INSTANCES 10 


#define ABC_BASE_ADDRESS  9x1239@ 
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#define ABC_IRQ 7 
define ABC_PRIORITY 5 
+define ABC_PAUSE 100 /* 1 second */ 


Listing Two 
(a) 


/* Name: abc.c + / 
/* Description: * / 
/* Developer: */ 


#include abc.h 

#include abc_appSpec.h 
#include abc_internal.h 
#include abc_map.h 
#include pdq.h 

#include rtos.h 


/* prototype for pdqAction() */ 
/* prototype for rtosStartTask() */ 


/*** Protected Data x / 
abce_struct abc_data[ABC_INSTANCES] ; /* declare Protected Data Structure */ 


/*** External Functions ***/ 
int abcInit() 


{ 
int i; 
for (i=@; i<ABC_INSTANCES: itt) 
{ 
abc_data[i].mode = 9; 
abc_data[i].device = (int *) (ABC_BASE_ADDRESS + (@x1@@ * i); 
} 
rtosStartTask(abc_task, ABC_PRIORITY) ; 
return (ABC_OK); 
} 


[*x*x  xxx/ 
int abcModeGet(int instance, int *mode) 
{ 
int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 
{ 
*mode = abc_data[instance] .mode; 
if (*mode == 
{ 
pdqAction(*abc_data [instance] . device) ; 
abc_data[instance] .mode = @; 


} 

result = ABC_OK; 
} ; 
return (result); 


} 


[*x** ex / 
int abcMessageSend(int instance, int message) 
{ 
int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 
{ 
if (abc_messageCheck(message) == ABC_OK) 
{ 
*abc_data[instance] .device = message; 
result = ABC_OK; 
} 
} 
return (result); 


(b) 


/* Name: abc_internal.c */ 
/* Description: */ 
/* Developer: */ 


#include abc_appSpec.h 
#include abc_internal.h 
#include abc_map.h 
#include rtos.h /* prototype for rtosPause() */ 
/*** Internal Functions ***/ 


int abc_instanceCheck(int instance) 


{ 
int result = ABC_ERROR; 
if (instance < ABC_INSTANCE) 
{ 
result = ABC_OK; 
} 
return (result); 
} 
[*** *RK/ 
int abc_messageCheck(int message) 
{ 
int result = ABC_ERROR; 
if (message == ABC_MESSAGE_1 |; message == ABC_MESSAGE_2) 
{ 
result = ABC_OK; 
} 
return (result); 
} 
[xxx xxx] 
int abc_task() 
{ 


int instance = @; 


(continued on page 114) 
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for(;;) 
{ 
rtosPause(ABC_PAUSE) ; 
if (*abc_data[instance] .device != @) 
{ 
abc_data[instance].mode = 1; 
instancet+; 
if (instance >= ABC_INSTANCE) 
{ 
instance = @; 
} 
} 
} 
(c) 
/* Name: abc_internal.h */ 
/* Description: */ 
/* Developer: */ 


/*** Internal Functions ***/ 

int abc_instanceCheck(int instance); 
int abc_messageCheck(int message) ; 
int abc_task(void); 


(d) 


/* Name: abc_map.h */ 
/* Description: */ 
/* Developer: */ 
/*** Protected Data ak / 
typedef 
{ 
int mode; 


int *device; 
Jabe_struct 


extern abc_struct abc_data[ABC_INSTANCES] ; 


(e) 


/* Name: abc.h * / 
/* Description: */ 
/* Developer: * / 


#define ABC_ERROR 1 
#define ABC_OK @ 


#define ABC_MESSAGE_1 1 
#define ABC_MESSAGE_2 2 


Configure Your 


Development Kit. 


Choose One (or more): 


Embedded Linux 
Windows CE 3.0° 
Windows NT Embedded* 
QNX® 

VxWorks® 

USSW Superfask® 

DOS 


COC 


Cw 


Choose One (or more): 


Q SBC—Geode 


Q ELAN104—NC 
QO AIM104—386 
a SBC-386 








int abcInit (void); 


int abcModeGet(int instance, int *mode); 
int abcMessageSend(int instance, int message) ; 


(f) 


/* Name: abc_appSpec..h 
/* Description: 
/* Developer: 


#define ABC_INSTANCES 
#define ABC_BASE_ ADDRESS 
#define ABC_IRQ 

#define ABC_PRIORITY 
#define ABC_PAUSE 


Listing Three 
(a) 


/* Name: abcControl.c 


/* Developer: 


#include abc.h 

#include abc_appSpec.h 
#include abc_utilities.h 
#include abc_map.h 
#include abc_task.h 
#include pdq.h 

#include rtos.h 


/*** Protected Data 


10 

0x123@ 

5 

5 

190 /* 1 second */ 


/* prototype for pdqAction() */ 
/* prototype for rtosStartTask() */ 


2 / 


abe_struct abc_data[ABC_INSTANCES] ; /* decl 


/*** External Functions ***/ 


int abcInit() 


int i; 


for (i=@; i<ABC_INSTANCES; i++) 
‘ 


abc_data[i] .mode 
abc_data[i] .device 


} 


are Protected Data Structure */ 


(int *) (ABC_BASE_ADDRESS + (@x190 * i); 


rtosStartTask(abc_task, ABC_PRIORITY) ; 


return (ABC_OK) ; 


[xxx  &*x/ 


Open it up. 
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Plug it in. 
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int abcModeGet(int instance, int *mode) 


int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 
{ 


*mode = abc_data[instance] .mode; 
if (*mode == 1) 


pdqAction(*abc_data[instance] .device) ; 
abc_data [instance] .mode = Q@; 

} 

result = ABC_OK; 


} 

return (result); 
} 
/* Name: abcMessage.c */ 
/* Description: */ 
/* Developer: */ 


#include abc.h 

#include abc_appSpec.h 
#include abc_utilities.h 
#include abc_map.h 


/**k External Functions ***/ 
int abcMessageSend(int instance, int message) 


int result = ABC_ERROR; 
if (abc_instanceCheck(instance) == ABC_OK) 


if (abc_messageCheck(message) == ABC_OK) 
{ 
*abc_data[instance] .device = message; 
result = ABC_OK; 


} 

return (result); 
} 
(c) 
/* Name: abc.h */ 
/* Description: */ 
/* Developer: */ 
#define ABC_ERROR 1 
#define ABC_OK i) 


#define ABC_MESSAGE_i 1 
#define ABC_MESSAGE_2 2 


int abcInit (void) ; 
int abcModeGet(int instance, int *mode); 
int abcMessageSend(int instance, int message) ; 


(d) 


/* Name: abc_utilities.c */ 
/* Description: */ 
/* Developer: */ 


#include abc_utilities.h 
#include abc_appSpec.h 
#include abc_map.h 


/*** Internal Functions ***/ 
int abc_instanceCheck(int instance) 


int result = ABC_ERROR; 
if (instance < ABC_INSTANCE) 
{ 


result = ABC_OK; 


return (result); 


} 


[*eK KK / 
int abc_messageCheck(int message) 


{ 
int result = ABC_ERROR; 
if (message == ABC_MESSAGE_1 |; message == ABC_MESSAGE_2) 
{ 
result = ABC_OK; 


return (result); 


} 

(e) 

/* Name: abc_utilities.h + / 
/* Description: */ 
/* Developer: */ 


/*** Internal Functions ***/ 
int abc_instanceCheck(int instance); 
int abc_messageCheck(int message) ; 


(f) 


/* Name: abc_task.c */ 
/* Description: */ 
/* Developer: + / 


#include abc_task.h 

#include abc_appSpec.h 

#include abc_map.h 

#include rtos.h /* prototype for rtosPause() */ 
/*** Internal Functions ***/ 

int abc_task() 


hitp://www.ddj.com 


{ 


} 


(g) 


int instance = Q; 


for(;;) 
{ 
rtosPause(ABC_PAUSE) ; 
if (*abc_data[instance].device != Q) 
{ 
abc_data[instance] .mode = 1; 
instancett; 
if (instance >= ABC_INSTANCE) 
{ 


instance = @; 


} 


/* Name: abc_task.h */ 
/* Description: */ 
/* Developer: */ 
/*** Internal Functions ***/ 


int 


(h) 


abc_task(void) ; 


/* Name: abc_map.h * / 
/* Description: * / 
/* Developer: + / 


/** 
typ 
{ 


Jab 


ext 


i 


* Protected Data *x x / 
edef 


int mode; 
int *device; 


e_struct 


ern abc_struct abc_data[ABC_INSTANCES] ; 


/* Name: abc_appSpec..h */ 


/* Description: */ 

/* Developer: */ 

#define ABC_INSTANCES 16 

#define ABC_BASE_ADDRESS 0x1230 

+#define ABC_IRQ 7 

#define ABC_PRIORITY 5 

#define ABC_PAUSE 100 /* 1 second */ 
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Charlie Ma 


eing a Javascript programmer usual- 
ly means you have to adroitly han- 
dle various browser compatibility is- 
sues, then walk the fine line between 
coding to the lowest common denomina- 
tor browsers or else taking advantage of 
some of the newer features and risk alien- 
ating those using older browsers. All this 
adds up to convoluted code that’s diffi- 
cult— if not impossible — to maintain. 

Things are so bad that even our clients 
have learned to curb their expectations. 
Over the past year, I have not had a sin- 
gle client ask for a web site that works 
with all browsers. They don’t even ask for 
compatibility with most browsers. These 
days clients know to say “compatibility 
with 4.0 and above” or risk being laughed 
at by the very consultants they try to hire. 
And of course, by convention “4.0 and 
above” means only Internet Explorer and 
Netscape Navigator. 

Those patient enough to have negoti- 
ated the nettling nonconformance between 
Navigator (Nav) and Internet Explorer (IE) 
have found rewards in huge billing rates 
and endless job opportunities. Given that 
both Microsoft and Netscape released Ver- 
sion 4.0 browsers in 1997, Javascript pro- 
grammers have been getting fat for more 


Charlie is Director of Technology at Terra- 
scope, a web consulting firm. He regularly 
teaches Perl and Javascript in San Fran- 
cisco, California. He can be contacted at 
charlie@terrascope.com. 
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than three years. Sure IE 5 has been out 
for nearly two of those three years, but 
for the most part, it’s nicely backward 
compatible with IE 4 (to the point of hav- 
ing navigator.appVersion = 4.0), and 
you've probably stayed well away from 
the 5.0 features to maintain Navigator 4 
compatibility. 

All that’s about to change. Netscape 6 
is out, Which has navigator.appVersion = 
5.0, and it is not backward compatible. 


What’s more, this new browser has an 
Open-source cousin called “Mozilla.” Re- 
cent trends have taught us that open- 
source projects have a way of picking up 
steam, and there’s a good chance that 
within a couple of years, you'll find Mozil- 
la as the dominant browser technology on 
nonMicrosoft platforms. So, even though 
Microsoft holds nearly 85 percent of the 
marketshare with IE4 and IE5, you can- 
not afford to ignore the new Netscape and 
Mozilla browsers. 

This is good and bad news. Depend- 
ing on how you’ve managed your old 
clients, they’re either going to run back to 
you screaming for help, or scream bloody 
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murder for the broken “4.0 and above” 
promise. The real news is that for the first 
time, there seems to be an emerging stan- 
dard in client-side web scripting. Certain- 
ly the 5.0 features of IE 5 and Netscape 6 
are more alike than different. Both Mi- 
crosoft and Netscape have announced 
their resolution to support W3C’s DOM 
specifications. 

W3C expects to release three levels of 
DOM specifications. 


e Level 1 deals with the actual document 
object hierarchy and how to navigate it. 

¢ Level 2 will specify an event model and 
script interfaces to element styles. 

e Level 3 addresses document loading and 
saving, and a content model. 


At this writing, DOM Level 1 has been 
released, DOM Level 2 is at Candidate Rec- 
ommendation stage, and Level 3 is a work- 
ing draft. 

If all goes well, Javascript program- 
mers will soon be able to enjoy Java’s 
level of platform independence. But first 
you have to get past that transitional pe- 
riod in which there is still a large enough 
population of 4.0 browser users requir- 
ing you to continue this browser com- 
patibility dance. And if history is any in- 
dication, IE and Nav will maintain just 
enough subtle differences to keep you 
awake at night. 

For the remainder of the article, [’ll 
use the abbreviations in Table 1. This 
may cause some confusion as other de- 
velopers (including some listed in the 
“References”) refer to the new Netscape 
and Mozilla as “6.0 browsers.” I tend to 
stick to what’s indicated in navigator.app- 
Version, which would make both of 
these 5.0 browsers. The only exception 
is that I am calling Internet Explorer 5 a 
“5.0 browser” even though navigator.app- 
Version is 4.0. 
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(continued from page 120) 
Browser and Platform Detection 

In order to write scripts to handle all of 
these browser versions, you first need 
to be able to detect browsers. Listing 
One is a simple script that does just that. 
I use this script in all the following ex- 
amples via a script inclusion: <script 
src='listing _1.js'></script> 


EE 





___ Netscape Navi 
Navigator 4 — 
Navigator 6 — 


Table 1; Browser versions and abbreviations. 








Some History 

The Document Object Model (DOM) is the 
object model for the browser’s document 
object. This object is available to client-side 
Javascript and presents an interface to all 
the elements in your HTML document. Pri- 
or to 4.0 browsers, the DOM API was much 
the same across IE and Nav. You used it to 
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ray, or forms via the document.forms array, 
and the like. That, along with some famil- 
iar methods like document.write(), was 
about the limit of the earlier DOM. 

Things changed when the 4.0 browsers 
introduced Cascading Style Sheets (CSS) 
and layers. It is beyond the scope of this 
article to discuss CSS and layers in detail 
(see references for more information). 
Here, I will concentrate on absolutely po- 
sitioned CSS elements that in Nav4 occu- 
py their own layer. Developers use abso- 
lutely positioned CSS to place floating 
elements on the web page (see Exam- 
ple 1). The problem is that while each 
such element in IE4 is just an object in 
the DOM with some style attributes, each 
layer in Nav4 is like a mini web page 
with its own little DOM. This means that 
if you want to access the image object 
Img sitting in a CSS layer (which you’ve 
identified with id='L1'), you’d use doc- 
ument.images['Img1'\for IE4, but docu- 
ment.layers['L1'].document.images['Img1/] 
for Nav4. It’s easy to see that if you have 
layers within layers, you’d end up with 
scripts looking like: document.layers[lay- 
erlld!......layers[layerNId'!.document.im- 
ages['Img1/. 


The New DOM 

W3C’s Level 1 (DOM1) specification de- 
fines the object hierarchy and methods of 
navigating this hierarchy for the new 
DOM, which has been adopted by the 5.0 
browsers. 

DOM1 has a hierarchy of elements that 
are similar to that of Nav4’s layers, but they 
are more generalized and called “nodes.” 
Before launching into nodes, I need to 
distinguish between HTML “container 
tags” (which are all start and end tag pairs 
such as <form> </form>, <p></p>, <body> 
</body>, and <html></html>, which wrap 
open and close tags around some content) 
and “noncontainer tags” (used without close 
tags and do not wrap around any content; 
these include <input>, <img>, <br>, and 
so on). Every tag in an HTML document is 
a node in the new DOM. Each container 
tag is the parentNode of the tags it con- 
tains, which together constitutes its child- 
Nodes array. The root node is the docu- 
ment object itself, and the leaf nodes are 
the noncontainer tags. 

Unlike Nav4 layers, DOM1 offers some 
nice ways to walk through the DOM hi- 
erarchy as well as ways to access the 
nodes directly without having to step 
through the hierarchy at all. Table 2 lists 
some of these node properties and doc- 
ument methods. Using these interfaces, 
you can learn a lot about the DOM as im- 
plemented by 5.0 browsers. 

Listing Two is Javascript you can use to 
walk the hierarchy. The function that does 
the bulk of the work is getChildObjects(, 
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which takes a node as input (the other 
parameter is just to keep track of the in- 
dents for the display), and calls itself re- 
cursively to walk down, ultimately, to the 
leaf nodes beneath it. You can ignore the 
function prettyIndent(), which just makes 
everything come out nice and neat. The 
function showDOM() is the one you 
would actually call, which replaces your 
current document content with a large text 
area displaying the DOM hierarchy at the 
time sian ) was called. 
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Table 2: Useful properties and methods for navigating the DOM. 


Figure 1: The outputs of Example 2 produced by (a) IES; and (b) Nav5. 


Example 2 uses Listing Two’s scripts to 
display the DOM of a page similar to Ex- 
ample 1. Notice that I specified IDs for 
the script tags. This simply illustrates that 
you can place IDs in all HTML tags in 5.0 
browsers. This isn’t recommended in prac- 
tice, as earlier browsers may not be able 
to handle extraneous IDs in the tags. Fig- 
ure 1 shows the outputs of Example 2 pro- 
duced by IE5 and Nav5. 

Keep in mind that the DOM does not 
necessarily correlate to the source ie ~ 


a ha' id='txtaresa'> no< 
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HTML (as displayed by the browser 
menu’s view source). This can cause con- 
fusion. If you try to call showDOM() be- 
fore the document finished loading, the 
script breaks. This makes sense since the 
document object only exists after the doc- 
ument has finished loading. 

You can also try to invoke a script which 
alters the document content, then call show- 
DOMO) again. You'll then see the DOM in 
its new altered state (I'll do this in Exam- 
ple 3). This becomes a great debugging tool 





<html> 
<head> 
<style> 


</style> 

<script src='listing 1.js'></script> 
<script> S 

function init() { 


} . 
</script> 

</head> 

<body onload='init()'> 


</div> 
An example of a floating element. 


</body> 
</html> 








#L1 {position: absolute; left: 1@0px: top: 10@px:} 


ImgiObj = (isNav4)? document.layers|['L1'] .document.images['Imgi'] 


alert('Imagel sre is ' + ImgiObj.src); 


<div id='L1'><img name='Imgi' src='Imagel.gif'> 


<img name='Img2' src='Image2.gif' align='top'> 


Example 1: Using absolutely positioned CSS to place floating elements on a web page. 


as you can try calling showDOM( at vari- 
ous states of your document and see how 
the DOM is responding to your scripts. 
Another way to do this is to replace 
the newHTML =... and document.write... 
lines of the showDOM() function with 
alert(msg);, which will display the DOM 
diagram in an alert box. The alert box’s 
lack of a scroll bar makes this method 
impractical for all but the simplest DOM. 
But you can call showDOM(objld) where 
objid is the ID of an HTML element. This 












document.images|'Imgi']; 












returns only this node and the nodes be- 
neath it. 

In Figure 1, Nav5 and IE5 produced 
slightly different outputs. The most no- 
ticeable difference is the additional #text 
nodes peppered throughout the Nav5 
DOM. A #text node is a leaf node that 
contains no HTML tags (that is, it is straight 
text). Consequently, Nav5 is really doing 
the right thing as it seems to have placed 
almost all occurrences of the newline char- 
acter into a #text node, while IE5 seems 
to do so only haphazardly. In fact, except 
for the #fext node that appears after the 
first <IMG> node, IE5 has completely ig- 
nored all newline characters. This incon- 
sistency is important to keep in mind as 
it may affect the way you traverse the 
DOM hierarchy. 

Another IE5 weirdness is the occurrence 
of a node for a nonexistent <TITLE> tag. 
Also in IES, the <HTML> node does not 
return a parentNode;, that’s why its chil- 
dren aren’t properly indented, making it 
appear as though <HEAD> and <BODY> 
are also child nodes of the document root. 
You can accommodate for this by un- 
commenting the else if (isIE5 ...) clause of 
the prettyIndent() function in Listing Two. 

W3C’s DOM specification defines vari- 
ous nodeTypes. Listing Two only distin- 
guishes between Type 1, all HTML tags 
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except CDATA and comments, and non- 
type 1, which include type 3 #text nodes 
among others. (You can read about all the 
node types in W3C’s specification.) You 
can see from Figure 1 that Nav5 and IE5 
treat comments differently. Nav5 observes 
the distinction and labels comments as 
type 8, while IE5 confusingly treats com- 
ments as just another HTML tag, with 
nodeName = '!' 

Just to be fair, I included a <TEXTAREA> 
tag in Example 2 to show that IE5 correctly 
(in my opinion) placed the enclosed text 
as a #text node, while Nav5 placed the text 
in the value attribute (as well as the de- 
Jaultvalue attribute). Yet even though IE5 
doesn’t put the text into <TEXTAREA>’s 
value attribute, you can still retrieve the 
text string from IE5 via document getEle- 
mentByld(txtarea').value. 

You can spend all day using the show- 
DOM(O) function from Listing Two to see 
how IE5 and Nav5 build the DOM and 
document all the little differences. I sug- 
gest you do just that, but for now I'll move 
on to the new DOM and its impact on 
your existing DHTML scripts. 


DHTML and the New DOM 

Just when you've gotten used to handling 
just two browsers (IE4 and Nav4), you 
now have to deal with four (IE4, IES, 
Nav4, and Nav5). So if you’ve been using 
lots of if statements to handle all the 
browsers in your script, you’d better think 
twice. Using this method to handle four 
different browsers will make the script 
needlessly complicated. 

Go back to Example 1. A common 
thing to do with floating layers is to show 
and hide them, move them around, 
change the z-index, and so on. But you’ve 
already seen how different IE4 and Nav4 
can be in terms of accessing the layer/CSS 
object. Now there’s the addition of IES 
and Nav5. Fortunately, 5.0 browsers are 
more alike than different, and for the most 
part, they can be treated similarly. But 
this continues to complicate the script. To 
place layer Z7 10 pixels from the left edge 
of the browser window, you need to ex- 
ecute Example 4. 

This is cumbersome to do every time 
you want to move an absolutely posi- 
tioned element. Imagine if you’re imple- 
menting a DHTML navigation menu wid- 
get with several levels of drop down 
menus. Writing code like Example 4 to 
handle all the CSS manipulation (show, 
hide, change z-index, and do the image 
rollovers) can quickly get out of control. 

Listing Three accomplishes this goal us- 
ing object-oriented Javascript. The con- 
structor function CSSObject() and the 
method prototypes wrap some of the 
browser dependencies with respect to CSS 
and layers within a standard interface. 
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tion: absolut 


id='listing_1' 
id='listing 2' s 


comment just fo 
name='formiName' 
f="javascript:sh 
ly> 





Example 2: Using Listing Two’s scripts to display the DOM of a page similar to 
Example 1. 
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<html> 

<head> - 

<style> 

#Li {position: absolute; left: Opx; top: Opx; z-index: 2;} 


#L2 {position: absolute; left: @Opx; top: Opx; z-index: 1;} 
</style> 
<script language='javascript' src='listing 1.js'></script> 
<script language='javascript'’ srce='listing_3.js'></script> 
<script> 
var objLi, objl2; 
function init() { 

objL1 = new CSSObject('L1'); 

objL2 = new CSSObject('L2'); 

objLi.moveTo(@,15@); 

objL2.moveTo(@,359) ; 
} 
function moveObj(obj) { 

obj.moveBy(10, 9); 

obj.moveBy(10, @); | 
} 
</script> 
</head> 
<body onload='init() '> 
<div id='L1'><img name='Imgi' src='Imagel.gif'></div> 
<div id='L2'><img name='Img2' src='Image2.gif'></div> 
This script should work with IE4, IE5, Nav4, and Nav5.<p> 
Click the buttons below to move the images.<br> 
<form><input type='button' value='move 1' onClick='move0bj(objL1) '> 
<input type='button' value='move 2' onClick='moveQbj(objL2) '> 
</form> 
</body> 
</html> 


Example 3: The DOM in its new altered state. 


if (isNav4) document.layers['L1'].left = 10 
else if (isIE4) document.all|['Li'].style.pixelLeft = 10 
else if (isNav5) document.getElementByIid('L1').style.left = 10 
else if (isIE5) document. getElementBylid('Li').style.pixelLeft = 19; 





Example 4: Placing layer L1. 


<html> 
<head> 
<style> 
#L1 {position: absolute; left: @px; top: Opx; z-index: 2;} 
#L2 {position: absolute; left: @px; top: @px; z-index: 1;} 
</style> 
<script language='javascript' src='listing_1.js'></script> 
<script language='javascript' src='listing_2.js'></script> 
<script language='javascript' src='listing 3.js'></script> 
<script> 
var objLi, objL2; 
function init() { 

objL1 = new CSSObject('L1'); 

objL2 = new CSSObject('L2'); 

objL1i.moveTo(@,150); 

objL2.moveTo(@,258); 
} 
function moveObj(obj) { 
obj.moveBy(1@, @); 
obj.moveBy(19, @); 


} 
function changeLayer(cssObj) { 
essObj.write("an <b>Image</b> use to be here!!"); 


</script> 

</head> 

<body onload='init() '> 

<div id='Li'><img name='Imgi' src='Imagel.gif'></div> 

<div id='L2'><img name='Img2' src='Image2.gif'></div> 

This script should work with IE4, IE5, Nav4, and Nav5 (including Mozilla) <p> 
Click the buttons below to move the images.<br> 

<form><input type='button' value='move 1' onClick='moveObj(objLli1) '> 
<input type='button' value='move 2' onClick='moveObj(objL2)'><br> 
<a href="javascript:changeLayer(objL1)">change 1</a> 

<a href="javascript:changeLayer(objL2)">change 2</a><br> 

<a href="javascript : showDOM()">showDOM() </a> 

</form> 

</body> 

</html> 





Example 5: Implementing the function changeLayer(). 
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Once you've instantiated an object using 
cssobj=new CSSObject(css_layer_id);, you 
can access all the DHTML features via the 
object’s methods without worrying about 
browser versions. Notice that you use doc- 
ument.getElementByld() to get the layer 
element from IE5 and Nav5, and the doc- 
ument.all collection to get the element 
from IE4. But Nav4 presents a special 
problem. While you would like to get the 
element from document.layers, it only has 
a collection of the top-level layers of the 
document. A Nested child layer can only 
be retrieved from the layers collection of 
its parent layer. Therefore, you define the 
function getNav4Layer ), which recursively 
steps through this layers hierarchy to re- 
trieve the desired layer element. 

Retrieving the layer from the document.all 
collection also works for IE5. But you 
should stick to the standard API defined by 
W3C as much as possible, and that means 
using document.getElementByld( ). 

Example 3 is an HTML document that 
creates two floating layers, and clicking 
the Move 1 or Move 2 buttons will move 
them to the right at 10 pixel increments. 
As this illustrates, using the functions in 
Listing Three dramatically reduces the 
code’s complexity. Another obvious ad- 
vantage to this is that if another DHTML- 
capable browser comes along, you only 
need to edit Listing One to detect it and 
Listing Three to handle it. Your HTML 
(such as that in Example 3) should never 
need to change. 

Many programmers have taken advan- 
tage of Nav4’s open(), close(), and write() 
functions to write HTML to a layer element. 
IE4’s equivalent is to set the innerHTML 
property of a CSS-layer element. With IE4, 
however, this works only if you had de- 
fined your CSS-layer with the <DIV> tag 
(as opposed to the nearly equivalent 
<SPAN> tag). This was fixed with IE5, 
which correctly implements innerHTML for 
both <DIV> and <SPAN>. However, in- 
nerHTML does not work at all for IE4 on 
the Macintosh. Likely, most people who 
use this property have decided that IE4 
users on the Mac constitute a negligible 
population. I don’t agree, but enough peo- 
ple are using innerHTML to dynamically 
alter a page’s content so that you should 
show how this can also be done with Nav5. 

The bottom of Listing Three defines a 
write method for the CSSObject class. No- 
tice the complexity of doing this in Nav5 
versus the relative simplicity of accom- 
plishing the same thing in Nav4 and IE5. 
All the methods used in the code are stan- 
dard W3C DOM methods except for 
range.createContextualFragment( ), which 
is a Netscape original. This method does 
as the name suggests— it takes a string 
and creates a document fragment by pars- 
ing the HTML content. The rest of the 
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code just selects the proper place to put 
the fragment, removes the current content, 
and appends the newly parsed fragment 
as children nodes. 

In Example 5, I also implemented the 
function changeLayer(), which takes a 
CSSObject and invokes the write() method 
to change the content of the CSS layer. 
This function can be accessed via the 
Change 1 and Change 2 links. Examine the 
DOM (via the showDOMO( link) before and 
after changing the layers’ contents. Cnci- 
dentally, Erik Arvidsson wrote a white pa- 
per, available at http://webfx.eae.net/dhtml/ 
mozInnerHTML/mozInnerHTML.shtml, 
which shows how you can add the in- 
nerHTML and outerHTML properties to 
Nav5 HTML elements.) 

Lastly, Listing Four shows two Javascript 
functions you can use to retrieve image 
and form objects by name from a docu- 
ment without worrying about the brows- 
er version. All browsers discussed thus far 
use the document.images and docu- 
ment.forms collections proper, with the 
frustrating exception of Nav4, which re- 
quires you to recursively examine each 
layer for the desired object. 

However, even Listing Four doesn’t pre- 
sent a satisfying solution. Since W3C and 
5.0 browsers are moving toward retrieving 
HTML elements by their IDs, I'd like to start 
doing this for forms and images. But nei- 
ther IE4 nor Nav4 can retrieve images and 
forms via IDs, so you aren’t ready to aban- 
don names. And to pass both name and ID 
to the functions just so you can use getEle- 
mentByld for Nav5 and IE5 would seem 
needlessly self indulgent. 


DHTML Site Strategies 

At this point, all the pieces are in place 
for you to start thinking about a cross- 
browser strategy. Here are the steps: 


1. You should remove all extraneous 
spaces and newline characters because 
you can no longer assume browsers will 
ignore them. 

2. Given that these browsers are moving 
towards identifying elements by IDs 
rather than names, you may be inclined 
to start adding IDs to all your HTML 
tags. Don’t do this. In fact adding IDs 
to your HTML may cause pages to 
break on browsers not knowing what 
to do with this additional attribute. For 
example this 


<script id='xyz' language='javascript' 
src='source file url'></script> 


breaks on Nav4, though interestingly the 
following does not. 


<script id='xyz' language='javascript'> 


. some javascript... 
</script> 
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The behavior is erratic enough that you 
would need to do some serious testing 
to document how each browser be- 
haves with each HTML tag when the 
ID attribute is added. (This would be a 
great, although tedious, task for an in- 
tern with nothing better to do.) For now 
I'd suggest using [Ds carefully and only 
on tags when you know it is safe 
(<DIV> and <SPAN>, for instance). 

3. Whenever possible, wrap browser dif- 
ferences within an object or function 
that presents a uniform API as I did 
with CSSObject in Listing Three and 
getImageByName() and getFormBy- 
Name() in Listing Four. 

4. Finally, keep an eye on browser usage 
statistics and start phasing in more ad- 
vanced features of the new DOM as soon 
as Older, problematic browsers (like TE4 
on the Mac and Nav4) drop off the map. 


The timeline is tight. Nav 5 (commer- 
cially packaged as Netscape 6) is practi- 
cally out the door. So start fixing your web 
sites, or start thinking of a good excuse 
to give your clients. 
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Listing One this.elem = document. getElementById(obj) ; 
this.css = this.elem.style; 
// in html examples this bit of code is referred to as file listing_1.js 


// Detect browser version else if(isNav4) { 
var isNav5 = false; this.name = obj; 
var isNav4 = false; this.elem = getNav4Layer(obj, 9); 
var isNav = false; this.css = this.elem; 
var isIE4 = false; } 
var isIE5 = false; else if(isIE4) { 
var isIE = false; this.name = obj; _ 
var isWin = false; this.elem = document.all[obj] ; 
var isMac = false; this.css = this.elem.style; 
} 
if (navigator.appName=="Netscape") { } 
isNav=true; [fas defines the moveBy method of CSSObject ----------------------- 
if(parseInt(navigator.appVersion) == 4) isNav4=true; function moveByNav(x, y) { 
if (parseInt (navigator.appVersion) == 5) isNav5=true; this.css.left = parseInt(this.css.left) + x; 
} this.css.top = parseInt(this.css.top) + y; 
else if(navigator.appName=="Microsoft Internet Explorer") { } 
isIE = true; function moveByIE(x, y) { 
if (navigator.appVersion.indexOf("MSIE 4") != -1) isIE4=true; this.css.pixelLeft += x; 
if (navigator.appVersion.indexOf("MSIE 5") != -1) isIE5=true; this.css.pixelTop t= y; 
} } 
// Detect OS platform if (isNav4 || isNav5) CSSObject.prototype.moveBy = moveByNav 
if (navigator.platform.indexOf("Win") != -1 ) isWin = true; else if (isIE4 |; isIE5) CSSObject.prototype.moveBy = moveBylE; 
if (navigator.platform.indexOf("Mac") != -1) isMac = true; [fp Sees defines the moveTo method of CSSObject ----------------------- 
function moveToNav(x, y) { 
a this.css.left = x; 
Listing Two - eee oe: 
// in html examples this bit of code is referred to as file listing_2.js function moveToIE(x, y) { 
// This function recursively steps through the node heirarchy this.css.pixelLeft = x; 
// and puts the structure in msg. this.css.pixelTop = y; 
function getChildObjects(obj, formatIndent) { } 
var msg = ''; if (isNav4 |; isNav5) CSSObject.prototype.moveTo = moveToNav 
if (formatIndent) { else if (isIE4 |; isIE5) CSSObject.prototype.moveTo = moveTolE; 
msg = formatIndent; fl eseeseese== defines the write method of CSSObject ----------------------- 
} // What follows only works with IE4 on Win (for css objects defined with <div> 
// and not <span>), IES on both Win and Mac, Nav4, and Nav5. 
if (obj.nodeType == 1) { function HTMLWriteNav4(html) { 
// nodeType==1 for all HTML tags other than comments and CDATA. this.css.document.open(); 
msg += "&lt;" + obj.nodeName; this.css.document.write(htm1) ; 
for (var j=0; j<obj.attributes.length; j++) { this.css.document.close() ; 
if (obj.attributes.item(j).nodeValue) { } 
msg t= "" + obj.attributes.item(j) .nodeName function HTMLWriteIE5 (html) { 
+ x"='" + obj.attributes[j].nodeValue + "'"; this.elem.innerHTML = html; 
} } 
} function HTMLWriteNav5 (html) { 
msg t= "&gt;"; var rng = document.createRange() ; 
} rng.selectNodeContents(this.elem) ; 
else { rng.deleteContents() ; 
// W3C's DOM1 spec provides other nodeTypes for text, CDATA, entity var htmlFrag = rng.createContextualFragment (htm1) ; 
// reference, etc. Not all of these are implemented by the browsers. this.elem.appendChild (htmlFrag) ; 
msg += obj.nodeName; } 
if (isNav4) CSSObject.prototype.write = HTMLWriteNav4 
msg += " nodeType: " + obj.nodeType + "\n"; else if (isNav5) CSSObject.prototype.write = HTMLWriteNav5 
for (var i=0; i<obj.childNodes.length; i++) { else if (isIE5 |; (isIE4 && isWin)) 
nextIndent = prettyIndent(obj, formatIndent) ; CSSObject.prototype.write = HTMLWritelIE5; 
msg += getChildObjects(obj.childNodes[i], nextIndent) ; 
e e 
return msg; Listing Four 
// does nothing other than manage the indents for output // in html examples this bit of code is referred to as file listing_4.js 
function prettyIndent(obj, formatIndent) { // returns document's image object of given name. 
if (formatIndent && formatIndent.length>3 && obj.parentNode) { // For Nav4 getImgObjNav4 is called 
nextIndent = formatIndent.substring(®, formatIndent.length-4) ; function getImageByName(imgName) { 
nextIndent += (obj == obj.parentNode.lastChild)? if (isIE4 |; isIE5 |; isNav5) { 
: t-- 8s toa! s return document.images[imgName] ; 
J } 
// uncomment to fix problem caused by <HTML> node not returning a parentNode. else if (isNav4) { 
// else if (isIE5 && obj.nodeName == ' ) return getImgObjNav4(imgName) ; 
// nextIndent = ' +-- '; } 
Hf 3 } 
else { // returns document's form object of given name. 
nextIndent = ' +-- ''; // For Nav4 getFormObjNav4 is called 
} function getFormByName(formName) { 
return nextIndent; if (isIE4 |; isIE5 ;; isNav5) { 
} return document. forms [formName] ; 
// calls getChildObjects and writes the returned msg to the } 
// text area of a new html page. else if (isNav4) { 
function showDOM(domObjId) { return getForm0bjNav4 (formName) ; 
var obj = (domObjId)? document.getElementById(domObjId) : document; } 
var msg = getChildObjects (obj); } 
newHTML = "<HTML><BODY><form><textarea rows=30 cols=120'>" + msg + // steps throug Nav4's layer heirarchy recursively 
"</textarea></form></BODY></HTML>" // to get the image object with given name 
document.write (newHTML) ; function getImgObjNav4(imgName, parent) { 
} var objlImage; 
var parentObj = (parent)? parent : document; 
— objImage = parentObj.images[imgName] ; 
Listing Three if (!objImage) (; 
for (var i=; i<parentObj.layers.length && !objImage; i++) { 
// in html examples this bit of code is referred to as file listing_3.js objImage = getImgObjNav4(imgName, parentObj.layers[i] .document) ; 
// taks a layer id and recursively steps through Nav4's } 
// layer heirarchy. Returns the layer with the specified id. } 
function getNav4Layer(layerId, parent) { return objImage; 
var objLayer; } 
var parentObj = (parent)? parent : document; // steps throug Nav4's layer heirarchy recursively 
for (var i=@; i<parentObj.layers.length && !objLayer; i++) { // to get the form object with given name 
if (parentObj.layers[i].id == layerId) { function getFormObjNav4(formName, parent) { 
objLayer = parentObj.layers[il]; var objForm; 
var parentObj = (parent)? parent : document; 
else { objForm = parentObj.forms[formName] ; 
objLayer = getNav4Layer(layerId, parentObj.layers[i]); if (l!objForm) {; 
for (var i=; i<parentObj.layers.length && !objForm; itt) { 
} objForm = getFormObjNav4(formName, parentObj.layers[i] .document) ; 
return objLayer; } 
} 
// object constructor for CSSObject. return objForm; 
function CSSObject(obj) { 7 


if (isIE5 || isNav5) { 


this.name = obj; DDJ 
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ust as Perl became the duct tape for 

the Web, XML is becoming the duct 

tape for e-business. As a universal data 

format, XML glues together disparate 
e-business systems that, in the process of 
conducting everyday business, need to 
perform hundreds of transactions per sec- 
ond without outages or crashes. Such sys- 
tems need XML processors that provide 
high performance with a small footprint. 
That’s what SAX offers. 

In this article, I’ll describe SAX, then 
show how you can use it in Visual Basic 
applications via the Microsoft XML 
(MSXML) parser. 


SAX: What Is it? 

SAX stands for the “Simple API for XML.” 
Developed in collaboration by the mem- 
bers of the XML community (see http:// 
www.megginson.com/), SAX is a simple, 
efficient, and high-performance alternative 
to the Document Object Model (DOM), a 
widely supported XML API developed by 
the WWW Consortium. Unlike DOM, SAX 
considers XML not as text or an object, but 
as a stream of events, such as: 


e Characters; events that happen when the 
parser reads the content of an element. 

e Start/end elements; an event that hap- 
pens when the parser reads element tags. 

e Processing instructions; an event that 
happens when the parser reads pro- 
cessing instructions. 


Since an application receives these 
events as the parser reads the document, 
there is never a moment when the whole 
document is in memory. As a result, SAX 
delivers better performance and a smaller 
footprint for large files. 


Eldar works in Microsoft's MSXML group. He 
can be contacted at eldarm@microsoft.com. 
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SAX2: A Simple 
API for XML 


XML without overhead in Visual Basic 


Eldar A. Musayev 


How Does it Work? 
To use SAX, you create event handlers, 
plug them into the parser, and call the 
parser. After that, the parser calls the nec- 
essary event handlers while parsing XML 
documents. Figure 1 illustrates how this 
relationship works. 

The SAX2 API currently has five event 
handler interfaces, each responsible for a 
separate group of events: 


¢ ContentHandler, which reports events 
related to the content of an XML doc- 
ument. 

e DTDHandler, which reports some events 
related to a document type definition 
(DTD): 

e ErrorHandler, which reports encoun- 
tered errors. 

e LexicalHandler, which reports more 
fine-grained lexical events, like com- 
ments, that the content handler ignores. 

e DeclHandler, which reports declarations 
within a DTD. 


To use these interfaces, you just im- 
plement the ones you need within your 
code and register them with the parser. 

Another interesting SAX feature is XMI- 
Filter, which is both a reader and a group 
of event handlers. XMLFilter receives events 
from the reader, does some work based 
upon those events, and then passes se- 
lected events to the next handler. Hence, 
each event calls a number of event han- 
dlers chained one after another. This lets 
you break XML processing in logical com- 
ponents, such as check inventory, compute 
tax, or charge credit card, as in Figure 2. 


Down to the Code 

In the following discussion, I'll implement 
a simple application that performs a key- 
word search based upon the component 
approach that SAX and XMLFilter provide. 
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The complete Visual Basic source code 
for this application is available electroni- 
cally; see “Resource Center,” page 5. There 
you will find three different implementa- 
tions of this task, showing three different 
ways to handle and connect SAX com- 
ponents. 

To keep the code manageable, the ap- 
plication’s task is to search an XML doc- 
ument (see Listing One) for those 
<book> elements whose title attribute 
contains the specified keyword, and then 
output a list of those books. Notice that 
in the XML document, most of a book’s 
details are attributes of the <book> ele- 
ment. The one exception is the book’s 
description, which forms the <book> el- 
ement’s content. 

As far as coding goes, this application 
has three clear phases: reading the XML 
document, filtering the document’s con- 
tents by keyword, and outputting the re- 
sults. Each of these phases correlates to 
a separate SAX component in the sam- 
ple applications that are available elec- 
tronically. (For these sample applications, 
you need the October 2000 MSXML Mi- 
crosoft XML Parser 3.0 release, available 
at http://msdn.microsoft.com/xml.) 


Using Visual Basic 

The first thing you need to do is create 
the necessary SAX components: a read- 
er, filter, and writer. For the reader, you 
can use the MSXML SAXXMLReader. You 
will need to implement the filter and 
writer components yourself, as described 
later. 


Dim reader As New SAXXMLReader 
Dim filter As New KeywordOnly 
Dim writer As New PrintOut 


Having created the components, you 
now need to set parameters and connect 
these components to each other: 
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Figure 1: Using SAX. 


(continued from page 130) 
TextBox1.text =" 
filter.searchKey = TextKey.text 
Set filter IVBSAXXMLFilter_parent = reader 
Set filter IVBSAXXMLReader_contentHandler 
= writer 


Alternatively, you could set the parent 
and contentHandler properties using type 
casting. In fact, type casting is a more 
proper way of setting these properties be- 
cause it really sets the properties of the 
filter interface instead of setting them via 
a pseudoproperty of an implemented class. 
This code uses type casting to set the par- 


ent property: 


Dim chf As IVBSAXXMLFilter 
Set chf = contentHandler 
Set chf.parent = reader 


Electronically available source code (see 
“Resource Center,” page 5) provides im- 
plementation for both direct setting of 
properties (the “complete filter’ example) 
and use of type casting (the “complete fil- 
ter type casting” example). 

Type casting, though more correct, also 
requires more coding. Type casting re- 
quires three lines of code to set the prop- 
erty; while using a pseudoproperty takes 
only one. 

Why do you need to set the parent 
property for the filter? Because of the fil- 
ter’s dual functionality: as a content han- 





Figure 2: Breaking XML processing 
into logical components. 
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dler for the reader, and reader for the 
writer. Setting the parent property not 
only sets the filter as the content han- 
dler for the reader, but also lets the fil- 
ter represent itself as a proxy reader by 
redirecting its own calls, such as 
“parse(),” to the real reader. 

Once you have created all the objects, 
linked them together, and set the pa- 
rameters, you can parse the document: 


On Error GoTo Uh_Oh 
filter IVBSAXXMLReader_parseURL 
TextFile.text 

Exit Sub 
Uh_Oh: 

TextBox1.text = TextBox1.text & 

"'« Error in XML file with the list of 
books *** " 


MSXML implements its SAX interfaces 
based upon COM/OLE, and COM/OLE in- 
terfaces start with “I.” So, MSXML added 
prefixes to the original SAX interface 
names: “I” for the high-performance COM 
interfaces used by C++, and “IVB” for the 
automatable interfaces used by Visual Ba- 
sic and scripting. 

Yet, MSXML uses no prefixes for its 
classes because those classes usually im- 
plement both interfaces (that is, there is 
no need for a separate “VB” prefix). A sin- 
gle coclass works for both C++ and Visu- 
al Basic. 

However, there are two names for 
each of these coclasses. This naming has 
nothing to do with programming lan- 
guages, but instead with versioning con- 
trol. For example, both C++ and Visual 
Basic can use either the SAXXMLReader 
and SAXXMLReader30 coclass. The dif- 
ference is that SAXXMLReader coclass is 
version independent and SAXXMLRead- 
er30 isn’t. Version independence means 


ln, ¥iCompietef iter ~ Mrrosoft Visual Basic [design] 


that the SAXXMLReader coclass auto- 
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matically uses the latest installed version 
of MSXML, while the version- dependent 
SAXXMLReader30 coclass continues to 
use only MSXML 3.0. 


Implementing the ContentHandler 

The PrintOut class implements the con- 
tent handler and prints everything out. To 
implement the content handler, open Vi- 
sual Basic and create a class module 
named PrintOut. 

Now, let Visual Basic know which in- 
terface you are going to implement: J/m- 
plements IVBSAXContentHandler. Next, 
add dummy implementations for all the 
necessary methods. To do that, just select 
the interface from the Object list (shown 
in the red circle in Figure 3) and then se- 
lect each of the methods, one-by-one, 
from the Procedure list (highlighted with 
a red arrow in Figure 3). 

You can now implement the output. In 
Listing Two, you direct the output to a 
text box on the form. This output hap- 
pens only from the startElement and char- 
acters methods. 


Getting Attribute Values 

In the original SAX specification, the con- 
tent handler returns Null if an attribute is 
not found. However, in Visual Basic, an 
empty string and Null are identical. Thus, 
MSXML’s implementation of this interface 
raises a trappable error instead of return- 
ing Null. 

To trap and process this error, the ex- 
ample code uses the attrVal helper 
method. This helper method (function) 
also provides some small conveniences 
like a default value and a prefix/suffix. 


Implementing the Filter 
The main function of the filter is to re- 
ceive SAX events and pass the selected 
ones on. Sometimes, it may change the 
content of XML as well, but in this exam- 
ple it does not. 

The filter should implement three in- 
terfaces (specifying a library is not nec- 
essary, but somewhat useful for read- 


ability): 


Implements MSXML2.IVBSAXContentHandler 
Implements MSXML2.IVBSAXXMLFilter 
Implements MSXML2.IVBSAXXMLReader 


Z strChars String} 


PPcORL. text € Text 
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Figure 3: Select the interface from the Object list, then selecting each of the 


methods from the Procedure Jist. 
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The only property of the filter interface 
is parent; see Listing Three. While it does 
not matter here whether you make the 
setter and getter private or public, making 
them public puts them in the interfaces 
pop-up list that appears when you im- 
plement interfaces in Visual Basic. 

You use similar code (see Listing Four) 
for the SAX reader property, content- 
Handler. You then set all other methods 
of the SAX reader interface to delegate the 
call to the parent (if there is a parent); see 
Listing Five. Also, you need to set the con- 
tent handler to track elements, and turn 
the pass-through flag on for elements with 
selected keyword and off after exiting the 
element; see Listing Six. Finally, you need 
to set all other content handler methods 
to simply pass the event to the next con- 
tent handler (provided that handler ex- 
ists); see Listing Seven. 


Shortcut Filter Creation 

There is a shortcut for implementing all 
filter/reader methods. This shortcut also 
helps to avoid type casting in the main 


program. For this shortcut, you imple- 
ment only the content handler and add 
the contentHandler property directly to 
the KeywordOnly class (that is, ‘Set read- 
er.contentHandler = filter’). The code 
for such implementation is also available 
electronically as a “simple filter” exam- 
ple (see “Resource Center,” page 5). 
However, the convenience of this short- 
cut comes at a price: 


e If you need other handlers (like a Lex- 
icalHandler or ErrorHandler), you will 
have to set them manually. 

e You cannot pass your new filter as a pa- 
rameter to a function that is expecting 
a reader to be passed to it. 


Conclusion 

The same MSXML SAX parser may be used 
in C++, with even greater gains in per- 
formance, by using a separate set of in- 
terfaces USAXContentHandler, ISAXDTD- 
Handler, and others). However, a complete 
description of this would require a sepa- 
rate article. 





SAX is often a simple and efficient way 
to read XML. One of the most compelling 
reasons to use SAX is when you cannot 
afford the load to the system associated 
with using the DOM. For example, there 
is no way to use the DOM (at least not 
directly) if your documents are so huge 
(many megabytes in size) that they just 
don’t fit into memory. Sometimes they 
may fit, but not enough for multithread- 
ed processing of several copies at the 
same time, which is usual for web-based 
applications. 

Another reason to choose SAX may 
not be necessity, but comparison. Sup- 
pose your Windows 2000 web server 
processes a high volume of average-sized 
XML messages. For MSXML, using SAX 
is normally several times faster than us- 
ing the DOM. For relatively simple ap- 
plications, the faster processing achieved 
by using SAX can translate into the abil- 
ity to serve twice as many clients at the 
same time. 
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Listing One 


<?xml version="1.8"?> 
<booklist> 
<book 


If Not IsEmpty(parent) Then: parent.parseURL strURL 


End Sub 


Listing Six 


title="Building Microsoft Exchange Applications (Solution Developer Series)" 


author="Peter J. Krebs" 
ISBN="157231334xX" 
instock="no"> 


This book will guide programmers and non-programmers to create a professional 
mail-enabled or groupware application in less than a day with Microsoft 


Exchange. 
</book> 


</booklist> 


Listing Two 


Private Sub IVBSAXContentHandler_startElement(...parameters skipped ...) 
Formi.TextBox1.text = Form1l.TextBoxl.text _ 
& vbNewLine & attrVal(attributes, "title") _ 
& attrVal (attributes, “author",. "", “ by ") ™) = 


& vbNewLine & "-- "_ 
. other attributes .. 


Private PutItOut As Boolean 
Private Sub IVBSAXContentHandler_startElement(... 


attributes skipped...) 


Dim i As Integer, s As String 


If strQName = "book" Then 
On Error GoTo noTitle 


s = attributes. getValueFromQName ("title") 


If InStr(s, 
End If 
noTitle: 


On Error GoTo @ 


If PutItOut Then 


searchKey) > @ Then 
PutItOut = True 


If Not IsEmpty(ch) Then 


ch.startElement strNamespaceURI,strLocalName, strQName, attributes 


End If 
End If 
End If 
End Sub 


& attrVal(attributes, "instock", " No inventory data.",_ 


End Sub 


Private Sub IVBSAXContentHandler_characters(strChars As String) 
Formi.TextBoxi.text = Forml.TextBoxi.text & text 


End Sub 


Listing Three 


Private parent As MSXML2.IVBSAXXMLReader 
Public Property Set IVBSAXXMLFilter_parent _ 


( ByVal RHS As MSXML2.IVBSAXXMLReader) 


Set parent = RHS 
Set RHS.contentHandler = Me 
End Property 


Public Property Get IVBSAXXMLFilter_parent() As MSXML2.IVBSAXXMLReader 


IVBSAXXMLFilter_parent = parent 
End Property 


Listing Four 


Public ch As IVBSAXContentHandlerPrivate Property Set _ 


, In stock: ", ".") 


Private Sub IVBSAXContentHandler_endElement (strNamespaceURI As String, _ 


strLocalName As String, strQName As String) 


If PutItOut And Not IsEmpty(ch) Then 


ch.endElement strNamespaceURI, strLocalName, strQName 


End If 


If strQName = "book" Then 


PutItOut = False 
End If 
End Sub 


Listing Seven 


Private Sub IVBSAXContentHandler_characters(text As String) 
If PutItOut And Not IsEmpty(ch) Then 


End If 
End Sub 


IVBSAXXMLReader_contentHandler (ByVal RHS As MSXML2.IVBSAXContentHandler) 


Set ch = RHS 
End Property 


Public Property Get IVBSAXXMLReader_contentHandler() _ 


As MSXML2.IVBSAXContentHandler 


IVBSAXXMLReader_contentHandler = ch 
End Property 


Listing Five 


Public Sub IVBSAXXMLReader_parseURL(ByVal strURL As String) 
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PROGRAMMING PARADIGMS 


Alternatives to Java 


Michael Swaine 


n ironic deference to this month’s 
theme, I have titled this month’s col- 
umn “Alternatives to Java.” I chose that 
title deliberately for its ambiguity and 
political slipperiness. It’s one of those 
phrases whose meaning depends on 
who’s uttering it, like “dimpled chad.” 
Here’s an utterance from one of the most 
Java-centric of companies: “IBM says that 
if Sun doesn’t honor the ‘verbal commit- 
ment’ it made to the industry and to IBM 
in 1995 to turn Java over to a third-party 
standards organization, then an alterna- 
tive to Java will be found.” (ClieNT Serv- 
er News, April 2000). If that’s what Java- 
friendly IBM says about alternatives to 
Java, what about Java-hostile Microsoft? 


A Choice of Beverages 

An agreement between Hewlett-Packard 
and Microsoft could blast a permanent fis- 
sure down the middle of the Java market. 
At the center of the negotiations is Chai, 
HP’s clean-room implementation of an 
embedded Java Virtual Machine, but a 
whole shopping list of Hewlett-Packard 
and Microsoft and other technologies is at 
issue, including e-speak, UDDI, XML, 
SOAP, C#, and .NET. Microsoft needs a 
Java solution for .NET; customers will de- 
mand it. But Microsoft’s dicey relationship 
with Sun raises problems for Microsoft. 
Enter HP with Chai. For Microsoft, Chai 
represents the benefits of Java without the 
aggravation. 

The idea is that Microsoft would build 
Chai into.NET. This would open .NET 
servers and Internet-based services to non- 
Windows Java apps and Java-based devices. 
It would do so, however, by using a JVM 
that is independent of Sun and has been 
described as “the furthest thing from Sun’s 
Java.” And that’s what has some people 
thinking that this could be the wedge that 
effectively splits Java and finally demolish- 
es the “write once, run anywhere” dream. 

Actually, Microsoft has had a license for 
Chai since 1998. Its license for Java from 
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Sun dates back considerably farther, but 
that’s been the subject of as much legal 
and political maneuvering as the Florida 
recount. Chai, on the other hand, is shap- 
ing up to be the keystone of a complex 
deal structure that would, among other 
things, resolve an XML technology issue 
over which Microsoft and HP are at odds. 

That controversy involves HP’s e-speak 
and Microsoft/IBM/Ariba’s UDDI (Uni- 
versal Description Discovery Integration), 
two overlapping specifications relating 
to registry, transaction rules, and busi- 
ness directory structure for b-to-b e- 
commerce. It appears, at the time I am 
writing this, that Microsoft and HP may 
agree to some compromise on e-speak 
and UDDI as part of the Chai/.NET deal. 
By the time you read this, it may be a 
done deal— although, at the time I am 
writing this, the U.S. presidential election 
is still undecided, so I’m not placing any 
bets on anybody coming to any agree- 
ment about anything. 

Another item on that shopping list of 
technologies that Microsoft and HP are 
negotiating is Microsoft’s C# language, 
sometimes portrayed as an alternative to 
Java. Microsoft wants to secure ISO cer- 
tification for C# (as well as for .NET), and 
HP’s support would be very helpful in 
that effort. HP wants the same for e-speak 
and Chai. 

Even without ISO endorsement, Mi- 
crosoft and HP could arguably establish 
Chai as a competing de facto Java stan- 
dard, evolving it farther from Sun’s de fac- 
to standard and fragmenting the market. 
Who was it who said that when two in- 
compatible standards compete, each be- 
comes a substandard? 


C# in the Exit Polls 

Back to C#: How plausible is the view of 
C# as an alternative to Java? One measure 
of that is developers’ willingness to try out 
this Microsoft offering. The research firm 
Evans Data polled developers about C#, 
with the following results: Of the 600 de- 
velopers polled last October, 30 percent 
said they would probably try out C# this 
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year. But just who are these developers? 
Well, two-thirds of that 30 percent are 
people who use Visual Basic more than 
half the time. Among heavy Java users, 
only one in 10 expressed any interest in 
trying out C# this year. Based solely on 
these results, it would seem that C# is, 
at least for now, less of a Java killer than 
a VB killer. 

That’s not the end of the story, howev- 
er. C# is very young: Tools and compilers 
are not yet publicly available. The lan- 
guage was designed to provide Java’s main 
benefits and will certainly challenge Java 
to some degree as it matures. 


Not a Banner Year for Java? 
One particular market for Java apps may be 
slipping away— rich media web banners. 

It was First Virtual that first introduced 
these rich media banners in 1996, using 
Java. First Virtual asserted that Java was 
the superior technology for such banner 
ads and win out over all competitors. It 
hasn’t worked out that way. In this mar- 
ket, Macromedia Flash is emerging as the 
winning technology. As of a year ago, 
more than half of interactive service shops 
had settled on Flash as their top choice 
for a tool for such work, according to a 
Jupiter survey. 

Alternatives to Java? Certainly, there 
are viable alternatives to Java mounting 
serious challenges to Java in particular 
markets. 


Third-Party Candidates Come in Third 
Third-party candidates don’t generally 
do well in America, probably because of 
a self-defeating feedback loop: They 
aren’t perceived as being winners, so 
they don’t get the support they’d need 
to become winners. Americans do like 
to back a winner. 

Minority solutions in software can pros- 
per even if they don’t win a majority of 
the electoral vote, though. The following 
development environments or tools have 
been pitched as alternatives to Java, by 
which the pitchers mean many different 
things. They’re all good tools, but is there 
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any sense to their specific claims to be 
alternatives to Java? 


e Internet C++. Xoom is promoting its C++ 
implementation as an alternative to Java 
and C#. “Build one C/C++ program — 
run it everywhere” is their slogan, and 
it certainly catches one’s attention. The 
key is the virtual machine, the ICVM. 
Xoom invites you to take your standard 
32-bit C/C++ programs and recompile 
them for use on the Internet, using 
ICVM as the execution base. The target 
market is people developing 3D online 
games. The pitch is Standard C/C++ with 
the portability benefits of Java. 

e Lisp. A little over a year ago, a study at 
JPL compared Java, C++, and Lisp im- 
plementations on a variety of criteria. 
The results go a long way toward de- 
bunking conventional wisdom that Lisp 
is slow. The results show Lisp perfor- 
mance as comparable to or better than 
C++, with significantly lower variability. 
The researchers argue that this reduced 
variability translates into reduced pro- 
ject risk, which may be a good point, 
but it speaks more to Lisp as an alter- 
native to C/C++ than as an alternative 
to Java. Their argument for Lisp as a Java 
alternative runs as follows: Lisp, like Java, 


provides automatic memory manage- 
ment, dynamic object-oriented pro- 
gramming, and portability. Those are 
Java’s purported benefits over C/C++. 
Lisp has all those benefits without the 
performance drawbacks of Java. Hence, 
it is a viable alternative to Java for many 
applications. 

Perl. Larry Wall, Perl’s creator, gave a 
talk some time ago titled “Perl: An Al- 
ternative to Java?” The claim here is 
straightforward, but limited. Perl is prob- 
ably the most widely used language for 
web programming (read “CGI pro- 
gramming”) and has the portability ad- 
vantages attributed to Java. But there 
are a number of other interpreted script- 
ing systems that, while perhaps not as 
popular as Perl, can make a similar 
claim to portability. What they can’t 
claim, as Perl can’t, are the benefits of 
a real, full programming language. On 
the other hand, Perl is so popular now 
that we could almost expect a speech — 
probably not by Larry Wall—titled 
“Java: An Alternative to Perl?” 

Juice. Juice is an alternative to Java 
specifically in the area of downloadable 
applets, particularly large applets, rather 
than in other Java-targeted markets like 
consumer electronics and household ap- 


Paradigms Past: History in the Making 


material that is necessary to collect, 

organize, and present for a history to 
work. Up to a point, a partial history 
isn't really a history at all. Only when 
you get close to an exhaustive survey of 
the subject do you have a chance of 
treating the material in a fair, accurate, 
balanced manner. 

But because each of the processes of 
collecting original material, doing inter- 
views, and writing lead to the discovery 
of new areas that need to be included (a 
process that can expand infinitely unless 
you find a logical way to curtail it), you 
find yourself early in the process with an 
unbalanced, unsatisfactory collection of 
material whose significance you may sus- 
pect, but you can’t properly articulate. 
The pieces may be interesting on their 
own, but their connection with one an- 
other and their significance in the big pic- 
ture are unclear. 

I'd like to draw your attention to such 
an unsatisfactory collection of material. 
At http://library.stanford.edu/mac/index 
-html you will find Alex Soojung-Kim 
Pang’s Making the Macintosh project. It’s 
good work, and it’s only unsatisfactory 
in the sense that I just articulated — it’s 
unbalanced because it’s not finished. 


iE is always some critical mass of 
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Among its jewels, the site has a num- 
ber of interviews with some people who 
know a lot about the creation of the Mac- 
intosh. Jef Raskin, for example. Now, Jef 
certainly has knowledge that no one else 
has about the origins of the Mac: He named 
it, he started the project, he is responsible 
for some key decisions in its early history. 
You can’t tell the Mac story without Jef’s 
input. Also, Jef is articulate, opinionated, 
and has thought long and hard and pro- 
ductively about user interface design. 

However, Jef’s view of what the Mac- 


intosh was supposed to be bears little 


resemblance to what it became after 
Steve Jobs elbowed him out of the pic- 
ture and took over the Macintosh pro- 
ject. Jef’s perspective might be the one 
clear vision of naked truth, but it’s not 
an unchallenged view. The site needs a 
balancing perspective. In this sense, the 
site provides depth without breadth. 
That's not to take anything away from 
the excellent content there now. 

It’s a nudge to those of you who 
know a piece of the story. This is a well- 
begun project, and with input from more 
of the people who know, it could be 
richer and fuller and more truly a work 
of computer history. 

— M.S. 
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pliances. Juice is based on Niklaus 
Wirth’s language Oberon. Juice is to 
Oberon as JVM is to Java. Most of the 
key work on Juice actually predates the 
release of Java. Juice’s inventor claims 
that his tree-structured program repre- 
sentation has significant advantages over 
Java’s byte-code representation. 

¢ Component Pascal. Speaking of Niklaus 
Wirth-inspired languages, Component 
Pascal has been mooted as an alternative 
to Java. Component Pascal is a consider- 
ably smaller, less complex language than 
Java. But like Java, it supports the dy- 
namic loading of code and metapro- 
gramming (reflection). In fact, it is suffi- 
ciently Java-like that there is a Component 
Pascal compiler that actually produces 
standard Java bytecode class files. 


Clearly different people mean different 
things by “alternative to Java.” Probably 
the only interpretation that carries any im- 
pact is “Java-killer.” Certainly, you might 
choose to use Perl for a job that another 
developer would tackle using Java, but 
that’s no threat to the existence of Java. 
Some would argue that there are only two 
real threats to Java’s survival: Microsoft 
and Sun. 

In December, Sun appointed new mem- 
bers of the executive board of the Java 
Community Process— members who do 
not work for Sun. Arguably, this is evi- 
dence that Sun is loosening its grip on the 
Java technology spec by an erg or so. It 
ain’t easy for Sun. You can almost feel the 
strain as the company pries its own cor- 
porate fingers back from the deathgrip on 
its technology. 


FileMaker: The Downgrade 

Last month, I wrote about our efforts here 
at The Prose Garden to use FileMaker Pro 
as the database centerpiece of the e- 
commerce solution we are developing for 
my wife’s farm/restaurant/andsoforth busi- 
ness, Summer Jo’s. I was also, at the same 
time, trying to say something useful about 
the FileMaker Pro 5 family of database 
products. I had hoped to go into a little 
bit of detail about our experience this 
month, and in the process give readers a 
sense of the strengths and weaknesses of 
FileMaker Pro 5. 

However, we are no longer using File- 
Maker Pro 5, and as a result, I can’t write 
too much about that release just now. I 
don’t want to suggest too much by our 
decision to drop back to Version 4.1. Al- 
though there was a security issue that in- 
fluenced our decision, the deciding factor 
was simple compatibility with our service 
provider, who is using Version 4. The up- 
shot, though, is that I haven’t looked at 
FMP 5 in the past month in sufficient 
depth to say anything useful about it. That 
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may change by next month. But whether 
it’s Version 4 or Version 5, FileMaker cer- 
tainly lets one knock out quick solutions — 
and that includes web-based e-commerce 
solutions. The scripting language built into 
FileMaker keeps the ease-of-use versus 
power balance from tipping too far from 
power, but it would be even nicer if the 
scripting language really were a scripting 
language. 

What it is, unfortunately, is something 
less. ’'ve been scripting FileMaker and 
reading the bible for FileMaker scripters. 
That would be Scriptology: FileMaker Pro 
Demystified, by Matt Petrowsky and John 
Mark Osborne (ISO Publishing, 1998, ISBN 
0-9660876-0-7). I have a few things to say 
about the scripting experience and the 
book. ScriptMaker is the programming com- 
ponent of FileMaker. It’s really just a macro 
generator. It does have logical operators and 
looping and branching statements and the 
ability for one script to call another. But it 
only automates manual operations that a 
FileMaker user/developer can perform, and 
despite what Petrowsky and Osborne say, 
it is not an object-oriented development 
system, or even a quasi object-oriented 
system. 

Petrowsky and Osborne’s book covers 
the ground it needs to cover: If you want 
to do anything with the scripting capabil- 


ity of FileMaker Pro, the book is almost 
mandatory for its documentation of the 
script language elements (called “steps”). 
It goes well beyond just documenting the 


Different people 
mean different 
things by 
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Java” 





steps, though: It includes a CD of useful 
scripts. I didn’t find them very usefully in- 
dexed; finding a script to solve a particu- 
lar problem is harder than it needs to be. 
But the scripts included have been cho- 
sen intelligently, and practicing FileMak- 
er developers should find many of them 
useful. 





The book also threads its way through 
the trickier parts of FileMaker, like cross- 
platform issues and using portals, from a 
scripting perspective. I understand why a 
developer at the FileMaker Developer’s 
Conference last summer told me this was 
a must-have book. One area that is han- 
dled particularly well is optimization. This 
material is written from a solid practical 
knowledge of the peculiarities of File- 
Maker. Still... 

I found the book less than ideally or- 
ganized, with typos and grammatical er- 
rors. I think I know why. On page iii of 
the book is the list of credits, and here we 
find that Osborne and Petrowsky wear a 
lot of hats. Managing editor, editorial su- 
pervisor, production director, production 
editors, technical editors, layout designer, 
cover designer, CD-ROM content devel- 
oper, interface developer, copy editor, 
product testers, and production team — it 
is possible to spread oneself too thin. Fine, 
boys, be your own CD-ROM content de- 
velopers, but if you copyedit your own 
writing and act as your own technical ed- 
itors— you know how they say that a man 
who acts as his own lawyer has a fool for 
a client? You might take it as a hint as you 
develop the next edition. 
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end users and service providers’ marketing 
and operations professionals to do just that. 
We are looking for developers, software 
engineers and pre-sales consultants to help 
us evolve and deploy our products in the 


optical, DSL and cable spaces. 


We have offices in two of the most 
stimulating technology centers in the world, 
Boulder, CO and Boston, MA. If you are 
ready to be a member of a first class team, 
contact hrke@emperative.com or visit our 
website www.emperative.com to see our 
current job postings. We'd love to hear 


from you. Refer to code # DRD-01-01 
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Changing Soapboxes Midstream 


Al Stevens 


recently ported Quincy 2000, the current 

ongoing column project, to the Enter- 

prise Edition of Visual Studio 6.0’s Visu- 

al C++. The CD-ROM for my copy of Ver- 
sion 5.0 was damaged, and I could not 
reinstall the compiler following recovery 
from a hard disk crash, so I decided to up- 
grade. Version 5.0 has some unresolved is- 
sues, mostly with respect to Standard C++ 
compliance, and I looked forward to see- 
ing how 6.0 addresses these issues. 

Converting a major application to a new 
compiler version sometimes involves prob- 
lems, particularly when the language def- 
inition has changed in the meantime, but 
this one was a breeze. There were only 
two things to change. Quincy used an ob- 
ject of the std::set<T>.:iterator class ini- 
tialized by and compared with the 
std::set<T>::end() member function that 
is no longer supported. I had to change 
the code to use the const_iterator class. 
The second change was because VCt++ 
6.0 issues a warning if you use the new 
operator to instantiate an object without 
assigning the returned address to a point- 
er when the class constructor does not 
specify the throw() qualifier. VC++ warns 
you that memory leaks might occur if the 
constructor throws an exception. Adding 
the throw() qualifier to the constructor’s 
signature removes the warning. 

I was disappointed to see that whereas 
VC++ supports the new~-style Standard C++ 
headers for Standard C functions (<cstdio>, 
<cstring>, and so on) and did in prior ver- 
sions, it still does not place the Standard C 
functions in the std namespace when you 
include those headers. There is no good 
reason why the newest version of a con- 
temporary C++ compiler fails to comply 
with the Standard in this respect. The 
framers of the Standard allowed for con- 
tinued support for legacy code by not pro- 
hibiting a complying compiler from sup- 
porting the legacy header format (<sidio.h>, 
<string.h>, and the like), which leaves the 
Standard C functions in the global name- 
space. The other, newer format should be 
fully conforming, and it is not in VC++. 

My understanding is that Microsoft’s rep- 
resentatives on X3J16 opposed putting the 
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C functions into std. Irrespective of the 
merit of their arguments, the committee 
passed the measure by majority vote, and 
Microsoft, being a willing participant in 
that particular democratic process, should 
ensure that its compiler complies. 

I sincerely hope that VC++ 7.0 will cor- 
rect these deficiencies, that it’s all a matter 
of priorities. Until I know otherwise, my 
only recourse is to assume that since VCt++ 
is the dominant C++ development envi- 
ronment for Windows, which is the domi- 
nant operating environment, all of which is 
controlled by Microsoft, the Microsoft de- 
velopers think they can do what they want. 
In this case, they want to defy the will of 
the majority and go their own way. So what 
else is new? This attitude reminded me of 
the Microsoft Windows-only Java imple- 
mentation, which started a lawsuit over li- 
cense violation, which got me thinking 
about Microsoft business practices, current 
events, and how it all affects programmers. 

Brace yourself for some Microsoft bash- 
ing, the industry’s most popular indoor 
sport, but one that I, until now, have not 
participated in with much enthusiasm. 


Breaking Up Is Still Hard to Do 
A while back, I characterized the Microsoft 
breakup ruling as being punitive and un- 
necessary. Since writing that opinion, I’ve 
read more about the case and the events 
leading up to the ruling, and I’ve changed 
my opinion— not the part about it being 
punitive, I still believe that. But I’ve revised 
my thinking about the necessity of the ac- 
tion, particularly as it affects programmers. 

My earlier attitude reflected what I per- 
ceived was best for programmers and com- 
puter users, which I can describe this way: 
A single, common operating environment 
best serves the majority of computer users 
who are relatively unsophisticated and who 
need and care only about having something 
productive and easy to use. An operating 
environment platform with a common API 
that implements a common user interface 
is also best for programmers who need a 
solid foundation upon which to build reli- 
able, usable applications— a foundation 
that users embrace so that a market exists 
for those applications. 

Given a dominant operating environ- 
ment, even when there are not competing 
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versions from competing companies, it is 
convenient to conclude that whatever busi- 
ness and technology model ensures the 
success of that environment and its com- 
pany, monopolies notwithstanding, is best 
for users and applications developers. 

In other words, to borrow cartoonist Al 
Capp’s parody of an infamous quote of 
the 1950s, “What’s good for General Bull- 
moose is good for the country,” implying 
that when you look out for the interests 
of big business, the people automatically 
benefit from trickled-down advantages. 
Of course, centuries of industrial misdeeds 
necessitating social reform and legal reme- 
dies have proven just the opposite. 

My earlier opinions were reinforced by 
the observation that the Department of 
Justice (DOJ) lawyers and the bench did 
not seem to understand the technical is- 
sues that underpin some of their objec- 
tions to Microsoft’s actions, and that Micro- 
soft’s defense team, including Microsoft’s 
technical and management witnesses, did 
an abysmal job of explaining those issues. 

Case in point: One of the DOJ’s charges 
was that by “tying” Internet Explorer (for- 
merly a standalone program) to Windows, 
the OS that virtually everyone uses, which 
unquestionably constitutes an operating- 
system monopoly, Microsoft applied 
predatory practices to eliminate browser 
competition, essentially putting Netscape’s 
browser out of the running. 

In the first place, IE and Netscape Navi- 
gator are not really products— they are 
gifts, free to anyone who wants to down- 
load them. They exist as incentives for users 
to use the companies’ real products. In the 
second place, we must view the desktop 
from a larger perspective. Here is where I 
think the defense really screwed up. If I 
had been called as a technical witness, and 
if those technically challenged lawyers 
would have asked the right questions, I 
would have offered an explanation but with 
this qualification: The litigators and the 
bench need to be more than casual com- 
puter users to understand these points. If 
his honor and learned counsel do not un- 
derstand the nature of desktops, icons, fold- 
ers, file systems, subdirectories, documents, 
applications, URLs, and such, they are un- 
qualified to make this case or rule on it, 
should recuse themselves, and should find 
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qualified people to take over. Of course, at 
that point I would have found myself in- 
vited off the witness stand and out of the 
courtroom, probably with a contempt cita- 
tion, which would have been appropriate. 

Nonetheless, here is the argument I 
think the defense should have made. If 
they made it (I have not read the com- 
plete transcript), they either made it inef- 
fectively, made it to those who did not 
understand, or made it to those who al- 
ready had their minds made up. 

A browser on a desktop is not really an 
application. Things on the desktop typi- 
cally represent locations where other things 
can be found. Among those things are mass 
storage devices and subdirectories within 
mass storage devices. Some of those de- 
vices are connected to the local computer; 
others are connected to other computers 
in the local network; others still are con- 
nected to remote computers that are part 
of the Internet. You might argue that by 
hosting HTML documents, encryption, and 
Java applets, the browser is more of an ap- 
plication than Windows Explorer, which 
only displays the hierarchy of file systems. 
The browser does add functionality, but 
that functionality is limited. You don’t mod- 
ify or update the content you view except 
by passing data to a real application in the 
server, and browser-launched Java applets 
don’t do anything important to your com- 
puter’s local contents. The browser’s main 
purpose is to let users view things on the 
Internet and send things to servers, which 
can be seen as littlhe more than an exten- 
sion of the local computer's operating en- 
vironment (desktop or command line), 
which is, indeed, a function of the OS. 

Microsoft did not tie an application to 
the OS; Microsoft extended the OS to in- 
clude integrated Internet browsing. If that’s 
tying products, then so is adding support 
for LANs, CD-ROMs, and DVDs. 

But even if you insist that IE is an appli- 
cation, one of the tests litigators apply to 
determine if tying two products constitutes 
unfair monopolistic practices is whether 
consumers would gain equal benefits by 
acquiring the products separately. Microsoft’s 
technical witness really blew it when he 
testified they would. The value of IE over 
Netscape is its degree of integration with 
Windows Explorer, which no one has sug- 
gested should not be a part of the OS. The 
benefit gained from an integrated browser 
is the seamless integration of the user’s ac- 
cess to content in a unified user interface. 
You look at a local hard disk’s subdirecto- 
ries, applications, and documents by using 
the same kind of window that you use to 
look at documents on the Web. The win- 
dow changes its configuration to accom- 
modate its current purpose, but it is the 
same kind of explorer window nonethe- 
less. Each window has an address bar into 
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which you can enter the address of any of 
the many places that holds things to view; 
the window system seamlessly responds to 
those entries appropriately. 

And that application of a common user 
interface is the advantage an integrated 
Internet Explorer offers that an add-on 
Netscape Navigator does not. A small ben- 
efit to be sure, but a benefit nonetheless, 
and one that passes the test against tying. 
It is merely a technical detail that the IE 
was, under Windows 95, distributed sep- 
arately and installed and integrated with 
Windows Explorer. That detail makes the 
browser no less a part of the OS than, say, 
an installable device driver. 

But Microsoft did not make these ar- 
guments. It simply said, “It’s my OS, and 
I’m Microsoft, and you can’t tell me what 
I can put in it.” 

These are, of course, technical arguments 
and only a small part of the case against 
Microsoft. And if that’s all there was to it, 
I'd continue to oppose breaking up the 
company. Other elements of the case 
caused me to reconsider my position, how- 
ever. One is the Microsoft attitude, epito- 
mized in public statements by Bill Gates 
and Steve Ballmer during the process, in- 
dicating that because they are who they 
are, they can do whatever they want. The 
attitude is painfully apparent, too, in Gates’s 
famous taped deposition and the testimo- 
ny of most management witnesses for the 
defense. Such arrogance begs a comeup- 
pance. Such evasion begs for accountabil- 
ity. Now, that’s not a legal opinion, it’s just 
my reaction to the swaggering and postur- 
ing of a couple of neighborhood bullies. 

Gates said publicly that whoever came 
up with the charges against the company 
and the subsequent recommendations for 
a remedy don’t know anything about the 
software business. He’s only partially right. 
They don’t seem to understand software. 
But I think they understand business and, 
more specifically, business practices, which 
leads to the second element of the case. 

Technology empowers new behavior 
and, consequently, changes culture and 
eventually law. As I opined a few months 
ago, the technology that enables personal 
file sharing will necessarily change the na- 
ture of intellectual property law in this cen- 
tury. Certainly a lot about the new econo- 
my will modify how things are done, too. 
But some things remain constant, and one 
of those things is that companies ought to 
obey certain rules of ethical behavior. Mi- 
crosoft has traditionally ignored that aspect 
of doing business, choosing to use power 
rather than excellence to eliminate compe- 
tition. When a monopoly successfully sti- 
fles competition by means other than pro- 
ducing superior products and providing 
superior service, innovation is likewise sti- 
fled. The incentive to compete is removed; 
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consequently the absence of competition 
suppresses any innovative advances the in- 
dustry might otherwise make. 

Without innovation, the consumer los- 
es in the long run, and my argument that 
a common UI platform for users and pro- 
grammers is best takes second chair to the 
more compelling argument that an absence 
of competition retards the growth of tech- 
nology. This situation has a profound ef- 
fect on programmers because it dimin- 
ishes what we are likely to be willing and 
able to do in the future. 

There are years-old precedents that doc- 
ument the Microsoft attitude about grab- 
bing good ideas. Many features in MS- 
DOS and Windows today were originally 
available as add-on products from other 
companies. File compression (Pkzip), disk 
compression (Stac), media streaming (Real- 
Audio), local networking (Netware), shell 
extenders (4Key), multitasking (Desqview), 
the GUI desktop (the Mac and GEM), and 
so on. Even the first Microsoft C compil- 
er came from a respected compiler ven- 
dor (Lattice) whom Microsoft dumped as 
soon as it had its own compiler almost 
working. Borland invented the integrated 
development environment that QuickC and 
later Visual C++ appropriated. Visicalc be- 
gat Lotus 1-2-3, which begat Excel. Visu- 
al Studio includes many developer fea- 
tures originally found in other products. 
Even MS-DOS is a knock-off of CP/M. 

Microsoft has no qualms about appro- 
priating a good idea and no conscience 
about the effect it has on whoever had the 
idea first. You might argue that the consumer 
wins because all the cool features are even- 
tually included in the bundle, but take a 
wider view from farther back. The next cool 
feature might never see the light of day be- 
cause small developers like us are reluctant 
to expend time, effort, and resources to im- 
plement new ideas knowing our products’ 
markets dry up when Microsoft rips off the 
ideas and gives them away. Microsoft is 
known to be a fairly good implementer, but 
not big on original ideas (unless, of course, 
you think .NET is a good idea). 

You might wonder why Microsoft does 
that. Why clone something someone else 
is selling and then give it away? My guess 
is that since a company generates contin- 
uing software revenue by upgrading prod- 
ucts with new features and selling new 
versions, and when the company is not 
itself an innovator, ideas for new features 
have to come from somewhere. 

The evidence indicates that Microsoft co- 
erced its hardware OEM partners into 
bundling only Microsoft software, specifi- 
cally IE, with the OEMs’ products. Further 
testimony indicates that Microsoft threat- 
ened to cancel the Mac Office 2000 project 
if Apple did not distribute IE exclusively 
with the Mac. With no Office 2000, and not 
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(continued from page 140) 

being a Wintel machine, the Mac would 
have no compelling killer applications. Then, 
Microsoft lobbied Intel to close down an 
Internet software project that targeted non- 
Microsoft solutions. If the OEMs, Apple, and 
Intel capitulate, two Microsoft competitors, 
Sun and Netscape, fall to the same stone 
(slung by Goliath, this time) and Microsoft 
owns the Internet server market. 

These coercions are the gravest of 
Microsoft’s peccadilloes. I don’t think any- 
one should be punished for developing a 
browser, integrating the browser into and 
bundling it with their OS, even if the OS 
constitutes a monopoly. I agree with Gates 
on that point. No one, least of all the gov- 
ernment, should be able to tell him what 
he can and cannot put into his operating 
system. Let the competition build better 
browsers and let consumers choose. Prece- 
dence exists for better OS enhancements 
from third parties. WinZip works better 
than the Plus! compressed folder feature, 
for one example. Building better software 
is not unfair competition. But putting the 
screws to your business partners such that 
they either buy into your agenda or go 
out of business is unfair coercion and 
ought not to go unpunished. IT don’t know 
what they call it in Redmond, but on the 
streets it’s called “selling protection.” 

Motive frequently entered the DOJ’s case 
against Microsoft. It was argued that Mi- 
crosoft wanted people using IE rather than 
Netscape because they wanted to corner 
the Internet software market. IE would work 
better with Microsoft servers, so server op- 
erators would tend to use Microsoft server 
software if everyone was using IE to 
browse. Furthermore, if Microsoft ignores 
the growing potential of Java to develop 
platform-independent Internet applications, 
Microsoft’s grip on the operating-system 
platform market is threatened, Windows be- 
comes just another local file manager/pro- 
gram launcher upon which users run 
portable Internet applications. But by dis- 
placing Java with something that Microsoft 
controls (C#, perhaps?) and controlling the 
browser and server software that accesses 
the Internet, Microsoft controls the API. By 
controlling the API, Microsoft corners the 
Internet applications market. When that hap- 
pens, Microsoft owns the Internet. 

A lot of time was spent during the trial 
establishing motive by introducing e-mail 
and meeting notes wherein Microsoft ex- 
ecutives vow to eradicate the competition. 
This evidence wastes time and energy. In- 
tent is neither relevant nor proven by 
records of what folks say to one another. 
I know they make a big deal about it in 
court, but actions are important here, not 
intent. People say all kinds of things in 
the heat of the moment and when they 
have an expectation of privacy. The only 
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time such evidence should be relevant is 
when the prosecution tries to prove con- 
spiracy charges against individuals or 
when the coercion at issue takes place in 
the e-mail or at the meeting. Internal talk 
about squashing competition is what busi- 
ness people do, typically with brutal, 
chest-thumping, half-time, locker-room, 
pep talk language. Lawyers and politicians 
can get as sanctimonious as they want and 
spout their self-righteous indignation about 
what someone said to someone else, and 
I am not impressed. What convinces me 
that Microsoft stepped out of bounds is 
the evidence of its deeds, not its words. 

I've never understood why Nixon didn’t 
burn the tapes, and I do not understand 
why Gates didn’t erase the e-mail. Perhaps 
the analogy is closer than we think. Per- 
haps the megalomania that drove both men 
and the power they wielded blinded them 
to the fact that their acts have consequences 
and that separate institutions exist with the 
power to monitor the actions of the mighty 
with an eye on the interests of the meek. 

This issue pulls at me from two sides. I 
am best described as a bleeding heart lib- 
eral, a badge I wear proudly. Show me a 
child who goes to bed hungry and my heart 
bleeds. Show me an elderly couple choos- 
ing between food and medicine, and my 
heart bleeds. Show me people targeted for 
police profiling and harassment based on 
how they talk or what they look like, and 
my heart bleeds. Show me a family living 
in a packing crate under a bridge, and my 
heart bleeds. Show me racial imbalance in 
our criminal judicial actions, and my heart 
bleeds. Show me children armed and dan- 
gerous in the streets and schoolyards, and 
my heart bleeds. To those ends, I want the 
government at some level to intervene, help, 
and bring about change. That’s what gov- 
ernment is for— to provide help and pro- 
tection to the people in matters where they 
cannot do it for themselves. 

At the same time, I have conservative 
views. I do not like government intrusion 
where it is neither needed nor wanted, par- 
ticularly when it comes to market-driven 
free enterprise and most particularly when 
it comes to telling us programmers what 
code we can write. My first reactions when 
government lawyers and politicians inter- 
fere with business in areas they do not un- 
derstand are to assume they are misin- 
formed and misguided, assume their actions 
reveal hidden agendas and political motives, 
and generally lobby for the other side. 

I guess the best way to reconcile these 
opposing voices from within myself is like 
this: Show me a mom-and-pop store 
pushed out of business by a huge chain’s 
cold, calculated, predatory, lowball pric- 
ing strategies, and my heart bleeds. 

So what good does a Microsoft breakup 
do for technology? Perhaps without a 
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powerful autonomy controlling the direc- 
tion the Internet takes, it can evolve in a 
healthier, market-driven environment where 
the needs of consumers rather than the am- 
bitions of a couple of billionaire plutocrats 
influence and direct the evolution. 

And we programmers can have more 
and better choices about what to work on 
and what to work with. 

Imagine that Microsoft’s actions went 
unchecked and it was permitted to contin- 
ue with business as usual, which is what 
happens if the final remedy addresses be- 
havioral rather than structural corrections. 
Imagine also that another cool Internet fea- 
ture is wafting about waiting to be imple- 
mented, perhaps your idea. Unless you 
work for Microsoft, you don’t waste time 
doing it because you know Microsoft will 
use its power to either kill the feature (ike 
it tried to kill Java) or clone and appropri- 
ate it. Either way, you lose. Assume you 
work for Microsoft. If good ol’ Bill doesn’t 
think your feature is cool for any reason 
whatsoever, it never sees the light of day; 
and you, the Internet, and its billions of 
users lose. That’s just too much influence 
for one person to have. 

The judge ordered a two-way breakup 
that doesn’t make a lot of sense to me. 
The Attorneys General of several states 
got together and recommended a stupid 
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remedy that would have Microsoft license 
the source code of Windows to other de- 
velopers, essentially creating not an open- 
source environment, but a nightmare of 
software-configuration management, or 
rather, the lack of it. 1 am certain that those 
who recommended the remedies do not 
understand the malady. 

Perhaps Microsoft ought to be broken 
into not two companies as the judge ruled, 
not three as the DOJ suggested, but four 
separate companies. One company would 
have the workstation operating-system mar- 
ket, including whatever programs provide 
access to the Internet from the desktop. The 
second company would have the Internet 
server market, including all server-side sys- 
tems programs. The third company would 
have the applications market, including desk- 
top and server-side applications, database 
managers, and so on. The fourth company 
would have the developer tools market and 
would implement language translators and 
libraries to support the APIs that the other 
three companies specify. These companies 
would operate independently and would 
have to talk not only to each other but to 
all the other companies in the same busi- 
ness. We've come far enough that ANSI/ISO 
could take on the standardization of the 
packets and protocols with which these var- 
ious entities communicate. Such a restruc- 
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turing would necessarily involve longer pe- 
riods of time before some things got im- 
plemented, what with the various lines of 
communication, fiefdoms, NIH syndromes, 
and all. But in the long run, the Internet 
would be market driven and, consequent- 
ly, created in the image of the people who 
use it rather than those who profit by it. 

Which billionaire gets to run which com- 
pany? I don’t give a rat’s ass. If we're lucky, 
theyll be so weary of the whole thing by 
then that they'll retire to their gardens, golf 
courses, and philanthropies, and the rest of 
the world can get back to work. 

What good would such a Microsoft 
breakup do for society? As the son of a 
prison guard, raised in the shadow of a fed- 
eral penal institution, submitted to the oc- 
casional lectures of my correctional officer 
Pop who understood the system, I long ago 
learned that institutionalized punishment 
has five clear objectives: Protect society from 
repeat offenses by the perpetrators; deter 
other potential perpetrators from doing sim- 
ilar deeds; punish convicted perpetrators; 
give victims and society some sense of re- 
venge and closure; and rehabilitate the per- 
petrators. Protection, deterrence, punish- 
ment, revenge, and rehabilitation. Getting 
four out of five wouldn’t be too shabby. 
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Java & NT Authentication 


Elisabeth Strunk 


any Internet sites require user- 
names and passwords for authen- 
tication. The site must store a user- 
name/password pair for each user 
of the site, and users must remember one 
for each secure site they visit. There’s an 
easier solution with intranet sites, how- 
ever. Since many organizations run on a 
Windows NT domain, users must re- 
member an NT username/ password for 
their workstations. Because only internal 
users can see intranet sites, there is a 
ready-made database for authentication 
on the network. 

This is straightforward in languages such 
as C++ or Visual Basic, which are platform 
dependent with built-in interfaces to Win- 
dows API functions. With Java, however, it 
is more difficult. Because of Java’s platform 
independence, it makes no sense to include 
interfaces to Windows functions. Microsoft 
does this with its version of Java, but to take 
advantage of it, the code must run on Micro- 
soft’s VM. Sun has created the Java Au- 
thentication and Authorization Service 
(JAAS), a standard extension to Java 1.3. 
Unfortunately, so far the NT module for the 
JAAS gets the NT permissions of users who 
are logged on to the machine that calls it, 
rather than authenticating specific users. 

Thus, the only flexible way to authenti- 
cate NT users with Java (without getting 
down to sockets) is to write a wrapper for 
the Windows method that performs the au- 
thentication, and call it from the Java pro- 
gram. In this article, I’ll show how to write 
just such a wrapper and call it using Java 
Native Interface (IND. I'll also call that wrap- 
per from another (possibly nonNT) machine 
using Remote Method Invocation (RMI) and 
create services, so the object and the RMI 
registry will run automatically; see Figure 1. 

These techniques may also be of inter- 
est to anyone trying to use a Windows 
Foundation Class (WFC) function, in that 
I'll describe how to write the code to in- 
terface between Java and C++, which can 
be modified easily to make any of those 
functions available to Java programmers. 


The LogonUserA Method 
To log on to an NT domain, Windows NT 
workstations use the LogonUserA method 
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found in Advapi32.dll. I use this method to 
authenticate the intranet site in the same 
way NT performs network authentication. 
The method declaration looks like this: 


BOOL LogonUserA(LPTSTR IpszUsername, 
LPTSTR lpszDomain, LPTSTR lpszPassword, 
DWORD dwLogonType, DWORD dw- 
LogonProvider, PHANDLE phToken); 


The function returns True if the au- 
thentication were successful; otherwise, 
False. All of the type definitions for the 
LogonUserA declaration can be found in 
windows.h. Listing One shows the argu- 
ments to the function. A note about per- 
missions: LogonUserA requires a special 
NT permission, SE_TCB_NAME. This is an 
operating-system privilege, so you must 
give OS privileges to any process that calls 
the method. There are two ways to do 
this: Make your class into a service, or run 
it under a user who has operating-system 
privileges. To give a user OS privileges, 
first make him a local administrator, then 
open musrmegr, go to Policies | User Rights, 
check Show Advanced User Rights, choose 
Act As Part Of The Operating System from 
the pull-down menu, and add the user 
under whom the process will run. This 
method obviously raises some security 
concerns and should be used with care. 


Calling LogonUserA with JNI 

All native code presented here is written 
in C++. However, you can also implement 
the techniques in other languages, in- 
cluding C and Visual Basic. 


1. Write a Java class that calls the native 
method. You can give the method any 
name you want in your class because 
you will call the Windows function from 
a C or C++ wrapper, not from Java di- 
rectly. Declare the method in the class, 
but that’s all— do not define it, just put 
a semicolon at the end of the declara- 
tion. The native keyword will tell the 
compiler that no definition is needed. 
My method declaration looked like this: 


private static native boolean logon- 
UserA(String userName, String userPass); 


2. Compile your .java file, then compile 
the C++ header file from your .class file. 
This gives you the declaration for the 
method that you must implement in the 
wrapper, telling the C++ code to make 
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it available for export to JNI. This dec- 
laration is slightly different from the nor- 
mal declaration to export a function in 
a DLL using only C++, which is why you 
must write a C++ wrapper class. To cre- 
ate the header file, use the javah utility, 
which can be found in /jdk1.x/bin along 
with java.exe. Because JDK 1.0 used a 
different API for native-method interfac- 
ing, you must also specify -jni, like this: 
javah -jni JavaNTLogon. This gives you 
the file javantlogon.h. If your class is in 
a package, specify the package when 
you run javah, and it appears in the 
name of the header file my_pack- 
age_ javantlogon.h. So, it’s easiest if you 
put your class in the right package to 
begin with. (if you do change packages, 
run javah again.) Also, be careful how 
long the package, class, and method 
names are— if they are too long, they 
will be truncated when they are ex- 
ported, and you will get an error. 

Notice that in the method declaration 
in the header file there are two pa- 
rameters that you didn’t declare. These 
are present in any function called 
through JNI, and they are automatical- 
ly passed, so you don’t have to worry 
about them. The first is an environment 
interface pointer, which gives access to 
certain methods for communicating be- 
tween the two languages (changing 
types, throwing exceptions, and so on). 
The second is a jclass pointer, which is 
the C++ equivalent of the this pointer 
for the calling Java object. 


. Write the C++ wrapper for the DLL. This 


class should include the header file out- 
put by javah, jnih, and windows.h. It 
should also implement the method de- 
clared in the header file output by javah. 
This is where you call the Windows 
method you're trying to access. Listing 
Two is the code for this method. 

First, use a typedef to tell the compil- 
er what the declaration of the WFC 
method looks like: 


typedef BOOL (CALLBACK* LPFNDLL 
FUNC1) 

(LPTSTR, LPTSTR, LPTSTR, DWORD, 
DWORD, PHANDLE): 


Then call LoadLibrary on the DLL the 
function is in HINSTANCE hDLL = Load- 
Library('Advapi32.dll");. Check to see 
if hDLL is nonNULL (whether the library 
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was loaded). If it is, get the export ad- 
dress of the function in the DLL: 
LPFNDLLFUNC1 f1 = (LPFNDLLFUNC'L)Get- 
ProcAddress(hDLL, "LogonUserA");. 

Again, ensure this is nonNULL, then use 
the pointer to the procedure address just 
like a function, and call LogonUserA as 
if it were a function in your code: 


ReturnVal = f1dpszUsername, IpszDomain, 
lpszPassword, dwLogonType, 
dwLogonProvider, phToken); 


e Parameters. Several types are repre- 
sented differently in Java and C++. Be- 
cause you passed the username and 
password each as a string, I need to 
convert the Java strings passed in (as 
type jstring, defined in jni.h) into the 
character array representation used in 
C++. jni-h defines a function to do this, 
GetStringUTFChars. This is a method 
called through the environment inter- 
face pointer (previously mentioned). 


const char * Username = env->Get 
StringUTFChars(userName, ()); 

const char * Userpass = env->Get 
StringUTFChars(userPass, 0); 





Figure 1: Execution sequence. 1. The 
LogonUser service calls the Java class 
that handles user authentication re- 
quests. 2. The Java code registers an 
instance of itself with the RMI registry, 
so any machine can try to log a user 
onto a site using the network. 3. A client 
application needs to use the LogonUser 
object. 4. The registry returns the object. 
5. Now, the client can conceptually talk 
directly to the Java code on the server. It 
sends a username and password. 6. 

The Java code, through JNI, calls 
platform-dependent C++ code to 
authenticate the user. 7. The C++ code 
calls the LogonUserA Windows API 
function in Advapi32.dill. 8. 
LogonUserA returns True or False to the 
C++ code. 9. The C++ code returns True 
or False to the Java code. 10. The Java 
code returns True or False to the client. 
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If the username and password you sent 
involve Unicode characters, use GetString- 
Chars() and ReleaseStringChars( ). 

LogonUserA doesn’t actually change the 
arrays, but because the declaration does 
not declare the strings constant, the com- 
piler gives an error unless I copy the con- 
stant array into a nonconstant array. 


//assumes lpszUsername and |pszPassword 
have sufficient space allocated 

strcpyUpszUsername, Username); 

strcpy(pszPassword, Userpass); 


e Exceptions. If your code does not com- 
plete successfully, it can attempt to 
throw the proper exception back to the 
calling Java program. First, the wrapper 
must find the Java exception class to 
throw, then it must throw it: 


jclass exceptionClass = env->FindClass 
("java/lang/Exception"); 
if (exceptionClass == 0) return 0; 
env->ThrowNew(exceptionClass, 
"couldn’t find C function"); 
//the second argument will be the 
//exception’s message string 


Remember that any exceptions the 
method throws must be stated in the na- 
tive method declaration in your Java class. 

e Cleanup. Don’t forget that C++ doesn’t 
do garbage collection. You need to deal- 
locate space created through malloc and 
new, and also LoadLibrary and Get- 
StringUTFChars. Otherwise, you may 
cause a memory leak. Also, check to en- 
sure that if you throw an exception or 
return before the end of the program, 
you do your cleanup first. 


FreeLibrary(hDLL); 

free(IpszUsername); 

free(pszPassword); 

free(phToken); 

env->ReleaseStringUTFChars(userName, 
Username): 

env->ReleaseStringUTFChars(userPass, 
Userpass): 


e UnsatisfiedLinkErrors. When you run 
the Java program that calls the native 
code, UnsatisfiedLinkErrors signals any 
problems the JVM has in finding your 
DLL or the specific procedure. If the er- 
ror says that your DLL was not found, 
make sure it is in your system path, 
along with cw3230mt.dll, which is also 
required. If it says that the procedure 
could not be found, make sure the pro- 
cedure name in the native code match- 
es your package name in both the 
method declaration in the header and 
in the definition (particularly if you have 
changed the package you put your code 
in since you used javah.exe). You may 
want to run javah.exe again, and change 
your #include statement and the method 
name accordingly. Also, check to be 
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sure the native method’s name is less 
than 50 characters long; otherwise, it 
may be truncated on export. 


RMI, LogonUserA, and Other Machines 
RMI is a way to use your machine to call 
a method on a remote machine (what a 
surprise). It has two sides— client and 
server. Your Java class will register an in- 
stance of itself with the server, and the 
client will ask the server to reference that 
class’s methods; see Figure 1. 


1. Write an interface to declare remotely 
accessible methods. This interface 
should extend java.rmi.Remote, and 
must declare any methods you want 
your calling class to be able to see. In 
this case, the only method needed is 
logonUser(String, String). My interface 
is called LogonUser, so when I get an 
object from the RMI registry it is of type 
LogonUser. 

2. Implement the remotely accessible 
methods in your Java class. That is, 
make /JogonUser() call the native 
method /ogonUserA()— the one you 
created earlier. 

3. Create the stub and skeleton for your 
Java class. First run javac on the .java 
source, then run rmic.exe on the .class 
file. (rmic.exe is found in the bin folder 
of the JDK, along with java.exe, javac.exe, 
and the like.) Copy the stub to the client 
program’s machine and set the machine’s 
classpath so that any applications that 
call your class can access the stub. 

4. Ensure that the RMI registry is running. 
If your object is the only one bound to 
the registry, you can create the registry 
in that code via java.rmi.registry.Lo- 
cateRegistry.createRegistry(_). Otherwise, 
you will probably want to create the 
registry via rmiregistry.exe, which is 
found in the same place as rmic.exe. 
The registry is only active as long as 
rmiregistry.exe is running. 

5. Register an instance of the interface with 
the RMI registry. I put the following 
code into a main() method of my class, 
so that I would be able to run my class 
as a service. You do not have to put it 
in the same class, but you must have 
some way to register your class. 

First, create an object of the type of 
your interface. You do this by calling a 
constructor for your class LogonUser 
user = new JavaNTLogont );. 

Then you will bind this instance to the 
RMI registry via java.rmi.Naming.bind( 
or java.rmi.Naming.rebind(): 


java.rmi.Naming.bind 
("http://localhost:1099/LogonUser", user); 


Registering your object with the RMI 
registry gives the registry a handle to the 
instance of the object you created. Thus, 
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(continued from page 140) 
it is valid only while the process that 
bound it is running. 

6. Write the client application for the remote 
object. First, copy the stub for the remote 
object to the client machine, and make 
sure it is in the classpath. Then, create a 
Java method that will call your object. 
Listing Three presents this method. 


Services, the RMI Registry, and Your Class 
Srvany.exe is a utility included in the Win- 
dows NT Resource Kit. It can be regis- 
tered with NT as a service and used to in- 
voke another executable to create a 
service out of that executable. This way, 
you avoid doing the extra work of writ- 
ing your own service. 

Srvany works by calling any specified 
executable, then reporting that it is run- 
ning as a service. Unfortunately, it has no 
way of knowing whether the executable 
was started successfully. Also, if the ser- 
vice is stopped, the executable’s process 
is simply terminated. There is no way to 
cleanup gracefully. If this is a problem, 
you can create your own service to keep 
the registry object bound. There are sev- 
eral tutorials for writing services available 
on the Web, and Microsoft’s Windows API 


documentation is helpful in writing them. 
To use Srvany, you first register SRV- 
ANY.EXE as a service. This is done by us- 
ing the instsrv utility, also in the NT Re- 
source Kit. The general syntax is instsrv 
[servicename] [path] \SRVANY.EXE, where 
servicename is whatever you choose to 
name your service, and path is the path 
where SRVANY.EXE is located. You need 
to do this for each of the services. 


>instsrv rmiregistry c:\NTRESKIT\ 
SRVANY.EXE 

>instsrv LogonUser c:\NTRESKIT\ 
SRVANY.EXE 


Next, open the system registry using 
regedt32.exe. Go to HKEY_LOCAL_MA- 
CHINE | SYSTEM | CurrentControlSet | Ser- 
vices. Here you will find a list of all the 
services currently installed on your ma- 
chine. Go to the first service you just in- 
stalled, rmiregistry, and create a Parame- 
ters key. In this key, create an Application 
value with type REG_SZ where the string 
for the value is /path/\\rmiregistry.exe. Do 
the same for lLogonUser, using 
[path]\java.exe. Then, also in the Lo- 
gonUser Parameters key, add an AppPa- 
rameters value of type REG_SZ where the 
string is the rest of what you would type 


on the command line; that is, classpath 
and qualified class name. 

Next, you may want to set LogonUser to 
depend on rmiregistry so that when Logon- 
User is run, there will be a registry existing 
with which it can instantiate itself. To do 
this, use another NT Resource Kit utility, 
SC.EXE. At a command prompt, simply type 
>sc config LogonUser depend= rmiregistry 

Finally, configure your new services in 
the Control Panel | Services applet the same 
way you would configure any other ser- 
vice. If you want it to run whenever the 
machine is restarted, set Startup to auto- 
matic. You will probably want to leave it 
under the system account, otherwise, en- 
sure that the account you choose has OS 
privileges. 


References 

Further information is available on the Web. 
The NT LogonUserA method documentation 
is at http://msdn.microsoft.com/library/psdk/ 
winbase/accclsrv_9cfm.htm. Sun has tutori- 
als on JNI and RMI. For JNI, see http://java 
.sun.com/docs/books/ tutorial/native1.1/ 
index.html. For RMI, go to http://java.sun 
.com/docs/books/tutorial/rmi/index.html. 
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Listing One 


LPTSTR lpszUsername // string that specifies the user name 

LPTSTR lpszDomain // string that specifies the domain or server 
LPTSTR lpszPassword // string that specifies the password } 
DWORD dwLogonType // specifies type of logon operation 


Values include: 
LOGON32_LOGON_INTERACTIVE 


//as that user. 
LOGON32_LOGON_NETWORK 
DWORD dwLogonProvider 
Values include: 

LOGON32_PROVIDER_DEFAULT 
LOGON32_PROVIDER_WINNT4@ 
PHANDLE phToken 


Listing Two 


// You will want to check to ensure you have deallocated memory through 
// every error path. I have left some of this out for readability. 


#include "logon_JavaNTLogon.h" // Header file output by the RMI registry 


#include <windows.h> 
#include <iostream.h> 
#include <string.h> 


//tells the compiler the function signature to expect. 


typedef BOOL (CALLBACK* LPFNDLLFUNC1) (LPTSTR, LPTSTR, LPTSTR, DWORD, } 


JNIEXPORT jboolean JNICALL Java_logon_JavaNTLogon_logonUserA 
(JNIEnv * env, jclass obj, jstring userName, jstring userPass) 


{ 
//declare all these variables we need 
jboolean successful=0; 


HINSTANCE hDLL; // Handle to DLL 
LPFNDLLFUNC1 f1; // Function pointer 

BOOL ReturnVal; 

LPTSTR lpszUsername = new char[12]; 

LPTSTR lpszDomain = "BFUSA"; 

LPTSTR lpszPassword = new char [3]; 

DWORD dwLogonType = LOGON32_LOGON_INTERACTIVE; 
DWORD dwLogonProvider = LOGON32_PROVIDER_DEFAULT; 
PHANDLE phToken = (PHANDLE) malloc (sizeof (HANDLE) ) ; 


//convert the java strings to UTF-8 strings so 
/ the dll can deal with them 


const char * Username = env->GetStringUTFChars(userName, 9); 
const char * Userpass = env->GetStringUTFChars(userPass, @); 


//returns impersonation token with user's 
// credentials. the token can then be used to run other processes 


//fastest, but does not return credentials 
// specifies the logon provider 


/pointer variable to receive impersonation token 
handle. Remember to allocate space for this--LogonUserA does not it for you. 


return @; 


else 


} 
else 


{ 
return @; 


} 

FreeLibrary (hDLL) ; 
free(lpszUsername) ; 
free(lpszPassword) ; 
free (phToken) ; 


env->ThrowNew(newExcCls, 
"couldn't find the Windows LogonUserA function") ; 


{ // try to call the function 
ReturnVal = f1(lpszUsername, lpszDomain, lpszPassword, dwLogonType, 
dwLogonProvider, phToken) ; 
successful = (jboolean)ReturnVal; 


jclass newExcCls = env->FindClass("java/lang/Exception") ; 
if (newExcCls == 9) 


env->ThrowNew(newExcCls, "couldn't load Advapi32.d1l in native code"); 


env->ReleaseStringUTFChars (userName, Username) ; 
env->ReleaseStringUTFChars(userPass, Userpass) ; 


return successful; 
DWORD, PHANDLE) ; 


Listing Three 


public static boolean logonUser (String userName, String password) 


throws RemoteException 


boolean isValid = false; 


There must be a security manager for certain classes to be 


// served through RMI. 


If you are getting security exceptions, you 


// may have to define your own security manager. 
if (System.getSecurityManager() == null) { 
System.setSecurityManager (new java.rmi.RMISecurityManager()); 


} 
try { 


// Specify the name of the server the remote logon object is on, 
// the port that the remote RMI registry is listening on, and the 
// name of the remote object we want to access. Format is just 


// like a URL. 


String remoteServer 


LogonUser user = 


1099 is the default port for RMI. 
= "//myServer:1699/LogonUser"; 


// Get a handle on the remote logon object by doing an RMI lookup. 
(LogonUser) java.rmi.Naming.lookup(remoteServer) ; 


//copy them into non-constant arrays for the function call 
strepy(lpszUsername, Username) ; 
strcpy(lpszPassword, Userpass) ; 


hDLL = LoadLibrary ("Advapi32.d11") ; 

if (hDLL != NULL) 

{ 

f1 = (LPFNDLLFUNC1)GetProcAddress(hDLL, "LogonUserA") ; 
if (!f1) 


// handle the error 
jelass newExcCls = env->FindClass("java/lang/Exception") ; 
if (newExcCls == Q) { 
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// Call the NT authentication method on the remote object. True means 
// the username and password was successfully authenticated, False 

// means it wasn't. 

isValid = user.validateUser(userName, password) ; 


} catch (Exception e) { 
e.printStackTrace(); 


return isValid; 


DDJ 
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Generating Perfect Hash Functions 


Thomas Gettys 


ashing is a searching technique. There 

are many ways to search a collection 

for an item, some very clever and el- 

egant. For raw speed, however, hash- 
ing is how it is done, period. 

Open any book on algorithms and you 
will find a chapter on searching. In that 
chapter, there will be a discussion on hash- 
ing. But after a short introduction to hash- 
ing and hash functions, you'll find that the 
bulk of the material is about collision reso- 
lution (see, for instance, The Art of Computer 
Programming, Vol.3: Sorting and Searching, 
by D.E. Knuth, Addison-Wesley, 1997). 

There are a couple of very good rea- 
sons for this focus on collision resolution. 
The first is that what makes for a good 
hash function can be very application spe- 
cific. Are the search keys numbers or char- 
acter strings? How is the population of 
keys distributed (compare the names un- 
der “G” in your phone book with the 
words that begin with “G” in your dictio- 
nary)? Considerations such as these will 
affect the design of your hash function. 

About all that can be reasonably said is 
to state the properties that good hash 
functions typically have in common and 
present some examples that have been 
found to work well (see “Hash Func- 
tions,” by Bob Jenkins, DD/, September 
1997). Note in passing that there are in- 
teresting similarities between hash func- 
tions and random-number generators. 

The second reason collision resolution 
techniques receive such attention is that col- 
lisions are a stone cold fact of life with dy- 
namic search lists. Even with the very best 
hash function in hand, collisions will occur 
when inserting items into a hash table. 





Figure 1: Array A. 


Thomas is a senior software developer for 
PSC Inc. He can be contacted at tpgettys 
@teleport.com. 
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In The Art of Computer Programming, 
Vol.3: Sorting and Searching, Knuth men- 
tions the so-called “Birthday Paradox” in 
this context. Suppose you have a hash 
table with 365 entries and a hash function 
that uniformly maps a person’s birthday 
to a table location. The probability that 
there will be one or more collisions when 
23 people are inserted into the table is 
greater than 50 percent! 


Dictionaries 

What if, however, you are blessed with 
the problem of searching a static list? There 
will be no insertions or deletions; the only 
operation is to determine if a given key 
is, or is not, in the list. This is called the 
“dictionary problem,” and occurs in many 
settings. 

Since the list to be searched is static, it 
makes sense to look for a hash function 
that minimizes the number of collisions. A 
hash function for which there are no colli- 
sions is called a “perfect hash function” 
(PHF). A PHF for which the hash table has 
no holes in it (that is, the hash table is only 
as big as the search list) is called a “mini- 
mal perfect hash function” (MPHF). 

There are many techniques for finding 
a PHF (or MPHF) for a given set of search 
keys; which to use is somewhat dependent 
on the characteristics of your particular set 
of keys. An excellent survey of the current 
state-of-the-art is to be found in the mono- 
graph “Perfect Hashing,” by Z.J. Czech, G. 
Havas, and B.S. Majewski (Theoretical Com- 
puter Science, 1997), which also features 
an extensive bibliography. 

In this article, I’ll demonstrate one of the 
techniques for generating a PHF for a set of 
numeric search keys that has come in handy 
on several occasions in my work on bar- 
code decoding. 

There are several barcode symbologies 
in use today, each with its own particular 
character set (for more information on bar- 
codes, see “A C++ Class for Generating Bar 
Codes,” by Douglas Reilly, DD/, July 1993). 
The bars and spaces of a character are nor- 
malized and combined into a single num- 
ber. The relationship between this number 
and the character it is defined to represent 
is arbitrary, and so a table search is nec- 


Dr. Dobb’s Journal, February 2001 





essary to perform the translation. Using a 
PHF, I can determine with a single calcu- 
lation if the number corresponds to a valid 
character, and if so, which one. 


Algorithm Overview 

The algorithm I use to generate a perfect 
hash function is surprisingly easy to ap- 
preciate: 


1. Start with a square array that is ¢ units 
on a side. 

2. Place each key K in the square at lo- 
cation (x,y), where x=K/t, y=K mod t. 

3. Slide each row some amount so that no 
column has more than one entry. 

4. Collapse the array down into a linear 
array; this is our hash table. 

5. The hash function uses ¢ and the dis- 
placements from step 3 to locate K. 


An Example 

To illustrate, I will use as the set of static 
keys the set S={0, 3, 4, 7, 10, 13, 15, 18, 
19, 21, 22, 24, 26, 29, 30, 34}. Since each 
key is to be mapped into a square of size 
t, do you see that ¢ must be at least 6? If 
you try f=5, for example, you would map 
the key K=34 to (x,y)=(6,4), which is out- 
side the 5x5 square. 

You therefore must chose ¢ such that 
t*t>=max(S). In Figure 1, I have inserted 
the keys from set S into square A by com- 
puting the coordinates as x=K/6 and 
y=K=mod6 (x is the row number and y 
the column number). 

This completes the first two steps in the 
algorithm overview. To accomplish step 
3, the rows are shifted to the right until 
there is at most one key in each column, 
recording the amount of each shift in the 
array r. This is known as the “first-fit 
method (FFM). The result of applying the 
FFM to array A is shown in Figure 2. 

The next step is to simply collapse the 
shifted rows down into hash table C (see 
Figure 3). Finally, you realize the hash func- 
tion H(K) as follows: 


x=K/t 

y=K mod t 
index=r[x]+y 
H(K)=Clindex] 
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Figure 2: Result after step 3 of the first-fit method. 
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Figure 3: Collapsing shifted rows down into hash table C. 





Figure 4: Result after step 3 of the first-fit decreasing method. 


(continued from page 151) 

For example, suppose K=15. You have 
that x=2 and y=3, r[2]=5, so the index into 
Cis 5+3=8. Since C[8]=15, we have de- 
termined that 15 is an element of S. 

What if K=17? Well, x=2 and y=5, so the 
index is r[2]+5=5+5=10. Since C[10]=19, 
you know that 17 is not an element of S. 
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A Small Improvement 

The real secret sauce here is in finding a 
good set of row displacements. As it turns 
out, finding a set of displacements that 
minimizes the size of C is known to be a 
hard problem. There is a heuristic that of- 
ten works well, however. It is known as 
the “first-fit decreasing method.” 


Give Us a Piece 
of Your Mind 





and click on the Discussion Forum link: 
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channels, 


The only change from what I just did 
is in step 3; instead of taking the rows in 
turn, you start with the most populated 
of the rows first. Thus, try taking the row 
in the order: 3, 0, 4, 1, 2, 5; see Figure 4. 
In this case, you achieve a minimal per- 
fect hash function. While this cannot be 
expected in general, it has been my ex- 
perience that the first-fit decreasing 
method does result in acceptable table 
compression. I will always try several val- 
ues of tand chose the one that yields the 
smallest hash table. 


Conclusion 

I’ve presented an efficient technique that 
is guaranteed to generate a perfect hash 
function for an arbitrary set of numeric 
search keys. Modifications to this tech- 
nique that make it suitable for letter-ori- 
ented keys can be found in chapter 5 of 
“Perfect Hashing,” by Z.J. Czech, G. Havas, 
and B.S. Majewski (Theoretical Comput- 
er Science, 1997). 

I hope that you are more encouraged 
to consider using hashing, knowing that 
a perfect hash function can be obtained 
quickly and with certainty. 


DDJ 
(Listings begin on page 155.) 
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Listing One 

DEBARGE SOIC CIO OGRA GOGO SIGS IAC IGK 

File: FFDM.C - First Fit Decreasing Method 

Overview: 

This program implements the FFDM algorithm for generating a Perfect Hash 
Function (PHF) for a given set of integer search keys. 

Invocation: 

FFDM KEYFILE t-VALUE 

KEYFILE is the oe Ma eat name of the text file containing the integer 
search keys, one per line. 

t-VALUE is the "magic" number to use for the hash function. The value of 

t must be such that t*t > max(key). 

Notes & Caveats: 

-The algorithm is described in the article "Perfect Hashing" in Theoretical 
Computer Science 182, 1997, by Czech, Havas & Majewski. 

-The data structure names mimic those from the article "Perfect Hashing" to 
ae a. easy to compare the two; the downside is that the names are not well 
chosen! 

ASC GC CIOR IOC CGO ASSIS GI KI Rak / 


#include <stdio.h> // fopen() fclose() 
#include <stdlib.h> // atoi() exit() 
#include <string.h> // strepy() 


// the following section of definitions can be extracted into a header file 
// error codes 


#define Success 4) 
#define t_value_ERROR =1 
#define fopen_ERROR =2 


#define key_value_ERROR -3 
// application-specific constants 


#define t_Max 100 // must be at least sqrt (max_key) 
#define HashTable_Max 1000 // uppe limit for 
#define InvalidKey -1  // a key value that's impossible for your app 
i Rowstruct 
int RowNumber; // the row number in array A[] [] 
int RowItemCnt; // the # of items in this row of A[] [] 


ie 
// global data 


// the arrays A[][], rf] and C[] are those in the article "Perfect Hashing" 
int A[t_Max] [t_Max] ; // Ali] [j]=K (i=K/t, j=K mod t) for each key K 
int r[t_Max]; // r{R]=amount row A[R][] was shifted 

int C[HashTable_Max] ; // the shifted rows of A[][] collapse into C[] 


// Row[{] exists to facilitate sorting the rows of A[][] by their "fullness" 
struct RowStruct Row[t_Max] ; // entry counts for the rows in A 


DOO C GCG AG GO GAS GAS SSSA AS IO II A GG ICR aC K Kk 

Function: void InitArrays (void) 

Overview: Prepares the algorithm data structures for use. 

Parameters: none 

Return Value: none 

Notes & Caveats: 

-A row offset may be @, so the items in r[] are set to a negative value to 
indicate that cha offset for each row is not known yet. 

-Every item in A[][] and C[] is set to a value that is known to be an invalid 
key for the specific application. -1 is often a good choice. 

IT Ete rere reer errr rrerrre rere rere rer er ere eter er errr er ere Pee he er re ay 


void InitArrays (void) 


int row, column; 
for (row = 0; row < t_Max; rowtt) 


r[row] = -1; // valid offsets are non-negative 
Row[row].RowNumber = row; // insert the row numbers and 
Row[row] .RowItemCnt = 9; // indicate that each row is empty 


for (column = @; column < t_Max; columntt) 
Al[row] [column] = InvalidKey; 


} 


for (row = @; row < HashTable_Max; rowtt+) 


C[row] = InvalidKey; 


preter ttt ttt rt ttt tit itt rrr trrerritretrrtrrrrrr irre rr err err erro eres tf 
Function: int ReadKeyData(char *Filename, int t, int *KeyCount) 

Overview: Reads the file of seach keys and maps them into the array A[][]. 
Parameters: 


char *Filename - the name of the file containing the search keys 

ant it - the number of rows in A[][]; max(key) must be < t¥t 
int *KeyCount - pointer to location to place number of keys read 
Return Value: 

fopen_ERROR - the specified file could not be opened (doesn't exist) 
key_value_ERROR - a search key value was too large (depends on t) 
Success - successful completion of responsibilities 


Notes & Caveats: 

-The number of items in each row is also computed and returned in 

Row[row] .RowItemCnt. 

-The number of keys is returned to the caller via a pointer. If an error 
is detected the number of keys reflects how many keys were read before the 


error condition was detected. 
FOO RR KR / 


int ReadKeyData(char *Filename, int t, int *KeyCount) 


FILE *fptr; 

int key; 

int row, column; 

*KeyCount = @; // set # keys=@ before attempting fopen 


fptr = fopen(Filename, "rt"); 
if (fptr == NULL) return(fopen_ERROR) ; 

// £i11 data structures using search key data 
ous ((fscanf(fptr, "%d", &key)) == 1) 


row = 
column = key % t; 

if (row >= t)_return(key_value_ERROR) ; 
A[row] [column] = key; 

Row[row] .RowItemCnt++; 

(*KeyCount) ++; 


return (Success) ; 


pete etter t ttt t iti t ir titre rer rere rte rte tert tre te ee ee ee ee ee 
Function: void SortRows(int t) 
Overview: sort Row[] in descending order of row fullness. 
Parameters: 
int t - the number of rows in A[][]; max(key) must be < t*t 
Return Value: none 


Notes & Caveats: 

-The algorithm needs to know which row of A[][] is most full, 2nd most full, 
etc. This is most easily done by sorting an array of row-item-counts and 
remembering which row the item counts go with. That is what the array 
"struct RowStruct Row[]" does for us. 

-I saw no point in trying to be clever here, so a simple bubble sort is used. 

BOA SAGAS GAGA GAGS A SAGA A AOI CC KK Ik kk kok kk / 


void SortRows(int t) 
int 4, 73 
struct RowStruct tmp; 
for (i = 6: i < t-1; i++) 
foe (] = aF12 j. < ts 37%) 


if (Row[i].RowItemCnt < Row[j] .RowItemCnt) 
{ 


tmp = Row[i]; 
Row[i] = Rowlj]; 
Row[j] = tmp; 


} 
} 
} 


void main(int argc, char *argv[]) 


int t, NumKeys; 
int k, ndx, rc, row, offset, PrintFlag; 
char Filename [64] ; 
// process the command-line arguments 
if (arge < 3) 


{ 
printf("usage: FFDM KEYFILE t-VALUE\n") ; 
printf("where: KEYFILE is the name of your file of numeric keys\n"); 
aes t-VALUE is a number such that t*t > max(key)\n"); 
exit(-1); 


strepy (Filename, argv[1]); 
t = atoi(argv[2]); 


if (t > t_Max) 
{ 
printf("t may not exceed %d\n", t_Max); 
exit (t_value_ERROR) ; 
} 
if (arge > 3) PrintFlag 
else PrintFlag 
// initialize data structures 
InitArrays(); 


// read in the user's key data 
re = ReadKeyData(Filename, t, &NumKeys) ; 
if (re != @ 


1 
QO: 


printf ("ReadKeyData() failed with error %d\n", rc); 
exit(rc); 


// prime the algorithm - sort the rows by their fullness 
SortRows(t) ; 


// do the First-Fit Descending Method algorithm 

// For each non-empty row: 

// 1. shift the row right until none of its items collide with any of 
// the items in previous rows. 

// 2. Record the shift amount in array r[]. 

// 3. Insert this row into the hash table C[]. 


for (ndx = @; Row[ndx].RowItemCnt > @; ndxtt) 
{ 


row = Row[ndx] .RowNumber; // get the next non-empty row 
for (offset = @; offset < HashTable_Max-t-1; offsett++) 
{ 
for (k = @; k < t; k++) // does this offset avoids collisions? 
if ((Cloffset+k] != InvalidKey) && 
(A[row] [k] != InvalidKey)) break; 
} 
if (k == t) 
{ 
r[row] = offset; // record the shift amount for this row 
for (k = @; k < t; kt+) // insert this row into the hash table 


if (A[row] [k] != InvalidKey) 
Cloffsettk] = A[row] [k]; 
} 
break; 
} 
if (offset == HashTable_Max-t-1) 
( 
printf("failed to fit row %d into the hash table\n", row); 


printf("try increasing the hash table size\n"); 
exit(=1)? 
} 
// all done! locate the "right-most" hash table entry 
for (k = @; k < HashTable_Max; k++) 
if (C{k] != InvalidKey) offset = k+1; 
// print the results 
printf("t value : ¥d\n", +t); 
printf("Number of keys : %d\n", NumKeys); 
printf("Hash table size : %d\n", offset); 
printf("Table utilization: %£%\n", 100.0*NumKeys/offset) ; 
if (PrintFlag != 0) 
{ 
printf("\noffset table r[]\n"); 
printf ("row offset\n") ; 
for (k = 0; k < t; k++) 
{ 
printf("%2d %3d\n", k, r[k]); 


printf("\nhash table C[]\n"); 
for (k = 0; k < offset; kt+) 


printf("%d\n", C[k]); 


DDJ 
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DR. 


Tundra 


Dennis E. Shasha 


|B ost people knew him as the gar- 
1 den columnist of the main daily 
'_ newspaper in Anchorage, Alas- 
/@ & ka. “But I have another identity 
ae some of my newspaper readers may 
not approve of: I build gas pipelines,” 
Alan Lionfall said. “Right now, when gas 
is discovered on the Alaskan north slope, 
it must be pumped back into the earth. 
I’m working on getting it into homes and 
power plants. Unlike oil, gas travels bet- 
ter underground at supercooled tem- 
peratures. The fewer the tunnels the bet- 
ter, especially in some of the areas 
precious to wildlife. 

“Now, I have a bit of a mess on my 
hands. You see, before I got involved, 
previous contractors started laying some 
pipe through a caribou migration area. 
The pipes are not connecting the right 
areas. Some of the pipes are probably 
unnecessary. Those contractors are so 
secretive, I don’t even have a map of 
the area. All I have are these ‘leg de- 
scriptions.’ I finally understand what 
they mean: 

“The positions are described by the let- 
ters A through M. We want to send gas 
from B to E— that’s what the contractors 
didn’t get. Each leg is a one way pipe from 
one letter position to another. Here are 
the pipes already laid: 





Dennis, a professor of computer science at 
New York University, is the author of The 
Puzzling Adventures of Dr. Ecco (Dover, 
1998); Codes, Puzzles, and Conspiracy 
(WH. Freeman & Co., 1992), Database Tun- 
ing: A Principled Approach (Prentice Hall, 
1992); (coauthored with Jason Wang and 
Bruce Shapiro) Pattern Discovery in 
Biomolecular Data: Tools, Techniques, and 
Applications Oxford University Press, 
1999); and (coauthored with Cathy 
Lazere) Out of Their Minds: The Lives and 
Discoveries of 15 Great Computer Scien- 
tists (Springer Verlag, 1998). He can be 
contacted at DrEcco@ddj.com. 
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ECCO’S OMNIHEURIST CORNER 


Twelve-year-old Liane stopped strum- 
ming her guitar and said, “So, a leg is an 
edge in a graph. From what I see, there 
is no path from B to E.” 

Lionfall looked at Liane with a smile. 
“So, it’s true she helps you solve the prob- 
lems people ask of you, isn’t it,” he asked 
ECCO, 

“If it weren’t for child labor laws, she 
might put me out of business,” Ecco said 
in a deadpan. 

“In any case,” Lionfall continued, “there 
is no path from B to Eas the young gui- 
tarist says. We have a bunch of routes we 
could potentially use without disturbing 
the environment and that each cost less 
than $1 million. Naturally, we'd like to 
build as few as possible. I list the possi- 
ble ones below. Which ones would you 
suggest we use? 
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Reader: Ecco and Liane were able to get 
away with building only five routes. Can 
you match them or do better? 


Last Month’s Solutions 

As you can see in Figure 1, Dr. Ecco’s so- 
lution was based on a diagonal placement 
of cisterns starting at locations: 


0 90 — requires 10 
0 80 — requires 20 
0 70 — requires 30 
0 60 — requires 40 
0 50 — requires 50 
0 32 — requires 50 
0 15 — requires 50 
0 00 — requires 50 
10 0 — requires 40 
20 0 — requires 30 
30 0 — requires 20 
40 0 — requires 10 


The result was about 847 cells lost 
with a standard deviation of 78. That was 


http://www.ddj.com 


assuming the cisterns lasted for the 
whole season. 

In Ecco’s best arrangement, using the 
same placement, nearly 1000 burn out 
when each cistern holds enough water to 
put out only one fire. 


Reader Notes 

Reader responses to the Mint puzzle (DD/, 
November 2000) gave me great pleasure. 
Not only did I see many solutions that im- 
proved on Ecco’s, but several readers not- 
ed the obvious applications of a rational 
coinage. 

Martin Brown of Belgium observed: “It 
might interest you to know that the Bel- 
gian currency goes 1/2, 1, 5, 20, 50, which 
as you can imagine, leads to pockets full 
of change.” 

Harm T. Voordenhout of the Nether- 
lands thinks this question has continental 
applications: “I had been asking myself 
the same question for about two years. 
The coming of the Euro, which would re- 
place a lot of coins in Europe, got me 
wondering about which system of coinage 
would be the best.” 

The following readers all improved on 
various aspects of Dr. Ecco’s solution: 
Alexander Enzmann, Austin Gilbert, Jon 
Beal, Tomas G. Rokicki, Patrick R. Schon- 
feld, David Stevenson, Steve Kietzman, 
Martin Brown, Harm Voordenhout, Rod- 
ney Meyer, Jimmy Hu, Bruce Moskowitz, 
and Matt Lasley. However, the best over- 
all solutions, in terms of completeness and 
original extensions, came from Ted Alper. 
Here are his solutions, some of which 
were equaled by other readers: 

For three coins: If you insist on intu- 
itive exchanges, the best average you can 
get is 5.3131...you can do this with de- 
nominations (1,5,22) (that is, with 1-cent, 
5-cent, and 22-cent coins), but if you al- 
low nonintuitive denominations, you can 
achieve 5.2020...with denominations 
(1,12,19). And if you allow Exchanges, you 
can have an average exchange number of 
3.9191...with (16,19,21): 


FOUR COINS: 

Best intuitive average is 4.1414. Achieved 
with (1,3,11,37). 

Best nonintuitive average is 3.9293. Achieved 
with (1,5,18,25). 

Best exchange average is 3.1212. Achieved 
with (13,28,32,33). 

FIVE COINS: 

Best intuitive average is 3.4949. Achieved 
with (1,3,7,16,40). 

Best nonintuitive average is 3.3232. Achieved 
with: (1,5,16,23,33). 

Best exchange average is 2.7273. Achieved 
with (13,24,30,32,33). 

SIX COINS: 

Best intuitive average is 3.1616. Achieved 
with (1,2,5,11,25,62). 

Best nonintuitive average is 2.9495. Achieved 
with (1,4,6,21,30,37). 
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Best exchange average is 2.5151. Achieved 
with (1,8,21,38,44,49). 


Ted showed by comparison to the opti- 
mum possible that his solution is best: “For 
6 coins, exchanging for the values 1...99: 
At most, 6 values can be done with only 1 
coin. You can get at most (6*5)/2+6=21 
different possible values with 2 coins (no 
exchange) and another 15 values with ex- 
changes of size 2 (bigger— smaller). So in 
the best case, in which all the remaining 
values take 3 coins, you'll have a total num- 
ber of coins required of 1*6+2%(21+15)+ 
3%*(99—6—21—-15)=6+2%(36)+3*(57)=6+72+ 
171=249, which is achieved with the coin 
sets I already computed.” 

Here were Ted’s solutions when each 
coin was worth a multiple of 5 cents: 
THREE COINS: 

Best intuitive average is 2.5203. Achieved with 
(5,15,40). 

Best nonintuitive average is 2.5263. Achieved 
with (5, 15,40). 

Best exchange average is 2.2105. Achieved 
with (5,30,45). 

FOUR COINS 

Best intuitive average is 2.1579. Achieved with 
(5,10,25,60). 

Best nonintuitive average is 2.1053. Achieved 
with (5,10,30,45). 

Best exchange average is 1.8947. Achieved 
with (10,25,65,70). 





Figure 1: Ecco’s solution. 


FIVE COINS 

Best intuitive Average is 1.9474. Achieved 
with (5,10,15,30,65). 

Best nonintuitive Average is 1.8421. Achieved 
with (5,15,25,30,65). 

Best exchange average is 1.73608. Achieved 
with (5,10,15,45,80). 
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Time Warps 
Gregory V. Wilson 


t’s been several months since I wrote 

my last set of book reviews, and a lot 

has changed in that time. The Software 

Carpentry design competition has fin- 
ished, the company I work for in Toron- 
to has been acquired, and I’ve spent a lot 
of time wondering why the gender gap 
among open-source developers is even 
wider than it is in computing as a whole. 
This hasn't left me as much time for read- 
ing as I would have liked, but the high 
quality of a few of the books that have 
come across my desk recently have more 
than made up for that. 

The most popular books on this month’s 
list will probably be Dave Thomas and Andy 
Hunt’s Programming Ruby, and Barbara 
Liskov and John Guttag’s Program Devel- 
opment in Java. The first of these is a com- 
prehensive introduction to a Perl-like script- 
ing language that has become very popular 
in Japan. As the authors’ web site (http:// 
Wwww.pragmaticprogrammer.com/) explains: 

You can think of Ruby as a mix of Perl and 
Smalltalk, or look at it as Python with full 
object- orientation. It features exception han- 
dling, closures, and iterators. Classes and 
objects can be altered and extended at run 
time. Everything is an object, including the 
basic types (such as numbers). Unrefer- 
enced objects are freed by a mark-and- 
sweep garbage collector. Ruby is portable 
too, and runs on a wide variety of systems. 
And, to cap it all, it has a simple, regular 
syntax. 


Part I begins with a 12-page overview 
of the language’s major features, then de- 
scribes classes, blocks, I/O, and other fun- 
damentals. Part II, called “Ruby in its Set- 
ting,” shows how Ruby can be used for 
CGI scripting, GUI construction, Windows 
automation, and similar tasks. The re- 
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David Thomas and Andrew Hunt 
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Program Development in Java: 
Abstraction, Specification, and 
Object-Oriented Design 
Barbara Liskov and John Guttag 
Addison-Wesley, 2001 

433 pp., $49.95 

ISBN 0201057686 


Ihe Interpretation of 
Object-Oriented 
Programming Languages 
lain Craig 

Springer Verlag, 1999 

254 pp., $79.95 

ISBN 1852531593 


MMIXware: A RISC Computer 
for the Third Millennium 
Donald E. Knuth 

Springer Verlag, 1999 

550 pp., $59.00 

ISBN 3540669388 


mainder of the book consists of a lan- 
guage reference, a guide to the standard 
Ruby libraries, and some appendices. 

Like their other book, The Pragmatic 
Programmer (reviewed here in March 
2000), the writing is lucid, economical, 
and illuminating. There are many exam- 
ples, all of which are easy to follow. While 
I would have liked more detail in some 
of the sections on applications, there’s cer- 
tainly enough here to jump-start new- 
comers. Overall, I give this book full 
marks, and I think that it deserves to be- 
come the standard reference for Ruby pro- 
grammers. 
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SAMS, 2000 
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Program Development in Java, by Bar- 
bara Liskov and John Guttag, is intended 
for use in a second-year course on soft- 
ware development methodology. It assumes 
that readers are already familiar with Java, 
and confirms that Java has become the 
dominant teaching language of the early 
21st century. The first part of the book an- 
alyzes various forms of abstraction, in- 
cluding the use of procedures, abstract 
data types, type hierarchies, polymorphism, 
and iterators. Exceptions are covered ear- 
ly on, as are the differences between 
checked and unchecked exceptions. This 
material is the basis of the second part of 
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the book, in which the authors look at 
specification, testing, requirements analy- 
sis, and other aspects of the design pro- 
cess. The last chapter introduces the no- 
tion of a design pattern, using flyweights, 
singletons, composites, and a few other 
simple patterns as examples. 

Overall, the book is well written, well 
edited, and has a useful index. Its only 
weakness is that there are a few places 
where the authors make general points, 
but do not provide enough specific ex- 
amples for their intended readers to un- 
derstand the generalization. For example, 
while the material on testing against semi- 
formal specifications is good, the testing 
of iterators, data abstractions, and type hi- 
erarchies get only a few paragraphs each. 
Very few of the college sophomores I 
know would be able to apply the gener- 
al principles that Liskov and Guttag make 
to their particular problems without such 
illustrations. 

Iain Craig’s The Interpretation of Object- 
Oriented Programming Languages is also 
a tutorial, but is aimed at a much more 
advanced audience than either of the pre- 
vious two books. Craig’s aim is to intro- 
duce the full breadth of object-oriented 
programming to programmers who only 
know it through its relatively narrow in- 
carnations in C++ and Java. In a compare- 
and-contrast style, Craig describes how 
systems such as Smalltalk and CLOS have 
implemented inheritance, delegation, poly- 
morphism, and typing. He shows readers 
that there are real alternatives— the way 
most of us are used to doing things isn’t 
the only, or even necessarily the best, way. 
While some explanations may be difficult 
to follow if you don’t have some previous 
exposure to these other systems, the ef- 
fort is definitely worthwhile. 

And speaking of “more advanced au- 
diences,” the next book on this month’s 
list is Donald Knuth’s MMIxXware: A RISC 
Computer for the Third Millennium. Knuth 
is one of the brightest minds ever to grace 
our so-called science. As his work on TeX 
and MetaFont shows, he is also an in- 
tensely practical master of the details of 
implementation. 

Knuth’s as-yet-unfinished master work, 
The Art of Computer Programming, is re- 
garded by many as the definitive work on 
the analysis of algorithms. In MVMIXware, 
Knuth describes a simulator for a virtual 
processor called “MMIX,” whose instruc- 
tion set will be used as the basis for the 
complexity analyses in the new edition of 
The Art of Computer Programming. 
MMrxware is written using the literate pro- 
gramming style that Knuth himself in- 
vented. Mathematics and Algol-like pro- 
gram statements are combined, indexed, 
and cross referenced to create a text that 
is sometimes dauntingly dense, but packed 
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with information. As with The Art of Com- 
puter Programming, only a few readers will 
have the stamina to go through this book 
from cover to cover, but it is a superb ref- 
erence for anyone who intends to use the 
software it describes. If only open-source 
software was this well documented, it would 
have taken over the world long ago... 

Of course, any mention of taking over 
the world these days brings XML quickly 
to mind. As Don Box, Aaron Skonnard, 
and John Lam explain in the preface to 
Essential XML, “{it| has replaced Java, De- 
sign Patterns, and Object Technology as 
the industry’s solution to world hunger... 
This is especially ironic given the relatively 
humble origins of XML, which lie square- 
ly in the world of document management 
systems.” 

Of course, XML has outgrown its HTML- 
plus roots, and is on its way to becoming 
a universal canvas that every kind of ap- 
plication can read and write. (This notion 
is explored in more depth in Jon Udell’s 
report for the Software Carpentry project, 
at http://www.software-carpentry.com/ 
Groupware/report.html.) Most of Essen- 
tial XML is, therefore, devoted to naviga- 
tion, transformation, schemas, and other 
data-exchange aspects of XML, rather than 
to its more traditional (in web years) use 
as an extensible successor to HTML. 

Unfortunately, this book races through 
the normal cases too quickly and spends 
too much time on the darker corners of 
various standards and standards-in-waiting. 
While this may be useful for experts who 
want an in-depth look at the special cas- 
es and exceptions that have motivated 
those standards’ more obscure features, 
most readers (including me) will find that 
it can take several minutes to work back- 
wards from some passing remark to an 
understanding of how something specific 
is supposed to be done. The writing is good 
and the humor is dry enough to bear 
rereading, but with nine XML books now 
on my shelves, I am still waiting for one 
that I can recommend whole-heartedly. 

I also have a hard time recommending 
Sean McGrath’s XML Processing with 
Python. The first of its problems is that 
much of its background material (such as 
the two-chapter introduction to Python) 
is too rushed to get newcomers up to 
speed, and too shallow to tell experienced 
programmers anything they don’t already 
know. I know it’s tempting to try to broad- 
en the potential readership of a book by 
including abbreviated introductions of this 
sort, but in my experience, they almost 
always suffer from this double fault. 

The second of this book’s problems is 
its focus on processing XML using Mc- 
Grath’s own token-per-line PYX system, 
rather than the standard SAX and DOM ki- 
braries. PYX is certainly interesting, as it 
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converts XML to a format suitable for in- 
put to grep and other UNIX command- 
line utilities. However, given the book’s 
title and the needs of most programmers, 
I think McGrath would have done better 
by showing readers how to drive SAX and 
DOM, particularly as this knowledge could 
then be applied in other languages as well. 
(In the last six months, for example, I have 
used C and Python versions of SAX, and 
C++ and Python versions of DOM.) While 
McGrath’s book does include one chapter 
each on these libraries, even here, his main 
goal is to show how they can be used to 
generate his token-per-line notation. 

The last and least of this month’s books 
is Christoph Wille’s Presenting C#. C# and 
the .NET platform are an interesting, in- 
novative, and well-designed marriage of 
Java’s platform- independent execution and 
COM’s object technology, and I am look- 
ing forward to working with it. Funnily 
enough, though, the word “Java” doesn’t 
even appear in this book’s index, and it 
glosses over the many compromises made 
in C# in order to allow it to recycle exist- 
ing COM components. I suspect that Pre- 
senting C# is selling rather well, as it is 
the first bound description of Microsoft’s 
new universal glue language to hit the 
market. I also suspect that it would have 
sold just as well if its author and publish- 
er had been honest enough to put “Micro- 
soft Corporation Marketing Department” 
on the front cover. If you think that in- 
dependent publishers are supposed to be 
just that— independent— then I suggest 
you give this particular book a miss. 

Finally, I enjoyed meeting a lot of peo- 
ple at the O’Reilly Open Source confer- 
ence in July, and was particularly pleased 
to discover that Tim Peters is not actual- 
ly a fictional character. However, I was 
disturbed by how few women were pre- 
sent in a technical capacity, even by our 
profession’s low standards. Computing has 
always been an unwelcoming environ- 
ment for women; a quick sample of sev- 
eral dozen SourceForge projects and the 
mail archives of four large open-source 
projects showed that open source has 
somehow managed to make itself even 
less congenial. 

I was therefore grateful when a colleague 
pointed me to work that’s being done to 
close this gap at Carnegie-Mellon Universi- 
ty. Partly as a result of the efforts by Jane 
Margolis, Allan Fisher, and others, the en- 
trance enrollment of women in the under- 
graduate CS program at CMU has risen from 
8 percent in 1995 to 37 percent in 1999. You 
can find out more about their work at http:// 
www.cs.cmu.edu/~gendergap/index.html; 
it's well worth the surf. 


DDJ 
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broad range of features to control all aspects of your 
product's operation. You can offer free one-time trials, turn 
demos into full multi-user network versions, or sell specific 
options by phone, fax or email. Sell usage by time, runs, 
features - the control is all there - easy to implement, and 
even easier to support; and it’s not fooled by reinstallation 
or back-dating. Imagine Internet distribution of your 
software with easy e-commerce features and automated 
authorizations without needing specialized equipment! 





Communication toolkits 
for precise control 
and system efficiency. 


X.25 ¢ HDLC ¢ LAPB « Bisync ¢ Async 
and custom protocols 
WineK ¢ NT 4.0 « Linux * OS/2 « DOS 


Choose Quadron tools to craft and fine-tune a communi- 
cations solution that matches your needs. ARTIC PCI and 
ISA cards have on-board processors that share the CPU's 

load. Check out our web site and call us soon. 


Quadron’ 


www.quadron.com 


telephone 805-966-6424 © fax 805-966-7630 © email info@quadron.com 


©2001 Quadron Corporation 


TOOLKIT VER. 2.0 


Contains everything you need to quickly 


and easily create a printer driver for 
inclusion into your application project. 


download 
the sample! 





 Overworked? 


Slash work 
with WinBatch 


Y/Y easiest macros 
VY tast development 


Y functions for 
everything 





http://see.winbatch.com 


1-800-762-8383 
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ANT Limited has announced its Fresco 
embedded browser for Java. This brows- 
er is written in C and uses the Java Native 
Interface to map platform-specific services 
onto appropriate Java implementations. 
The three-layer architecture consists of the 
ANT Portability Environment, which man- 
ages the interface to the hardware plat- 
form; the core standards-compliant brows- 
er core; and a graphical front end. Fresco 
is platform independent. 

ANT Limited 

20 Cambridge Place 

Cambridge, UK CB2 1NS 

949-363-5662 


http://www.antlimited.com/ 


Esmertec has released its Joed RTOS pack- 
age 1.3, an embedded real-time Java Vir- 
tual Machine. Enhancements to the 1.3 re- 
lease include improved TCP/IP networking 
performance and PPP, DNS, and daytime 
protocol support. The target bytecode 
compiler for dynamic class loading addi- 
tionally supports loading across HTTP and 
TFIP. New boot loaders have been added, 
the Jbed IDE has been enhanced, and 
functionality has been added for Interrupt 
Service Routines (allowing notification 
from an ISR to a waiting real-time task). 
Also new to the Jbed family is Jbed Mi- 
cro Edition CLDC, a small-footprint Java 
Virtual Machine designed for memory con- 
strained devices. 

Esmertec Inc. 

111 N. Market Street, 6th Floor 

San Jose, CA 95113 

408-351-3420 

http://www.esmertec.com/ 


The Curl Web content language is de- 
signed to integrate markup functionality, 
scripting functionality, and a full-featured 
object-oriented programming language, 
built expressly for client-side web appli- 
cations. The beta version of Curl Surge, a 
browser plug-in allowing users of Net- 
scape or Internet Explorer to run appli- 
cations written in the Curl language, is 
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now available for free download. As part 
of the beta program, Curl is also offering 
web developers and programmers the 
chance to test out the Curl language and 
build applications using Curl Surge Lab, 
an integrated developer’s environment. 

Curl Corp. 

400 Technology Square, 8th Floor 

Cambridge, MA 02139-3539 

617-761-1200 

http://www.curl.com/ 


Caesius Software has introduced WebQL, 
software designed to query the Internet 
like a database. Based on web crawling, 
regular expressions, Perl, and HTML, the 
program aims to help business intelligence 
and competitive analysis projects by ex- 
tracting specific information from the In- 
ternet through queries written in WebQL. 
Examples would include questions like 
“How fast is company X growing?” or 
“How many web sites display our logo?” 
WebQL can be used through the provid- 
ed user interface or through the command 
line, where it can collect data for other 
applications. 

Caesius Software 

P.O. Box 94587 

Seattle, WA 98124-9998 

206-280-6836 

http://www.webql.com/ 


Source Dynamics has announced Source 
Insight 3.0, a project-oriented program ed- 
itor and program browser designed for 
large programming projects involving mul- 
tiple languages. Supported languages in- 
clude C/C++, C#, Java, JavaScript, Perl, 
PerlScript, Visual Basic, VBScript, CSS, ASP, 
JSP, and HTML. Source Insight includes a 
graphical view of class hierarchies, call 
trees, reference trees, and relationships 
between context sensitive symbols; rich 
text formatting with automatic and user- 
defined styles; an object-oriented lookup 
engine and language parsers; support for 
nested classes and structures; support for 
C++ class and function templates; symbol 
syllable indexing including substrings; a 
symbolic autocompletion pop-up list; a 
context sensitive Smart Rename feature; a 
keyword-based Search Project feature; a 
context sensitive Lookup References com- 
mand; and support for projects with more 
than a million symbol definitions. Source 
Insight is available for Windows platforms. 

Source Dynamics Inc. 

22525 SE 64th Place, Suite 260 

Issaquah, WA 98027 

425-557-3630 

http://www.sourcedyn.com/ 


Intel has announced Version 5.0 of its C++ 


and Fortran compilers for Windows. The 
new compilers feature enhanced intrinsics 
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and class libraries that support the up- 
coming 32-bit Pentium 4 microprocessor; 
they also include a prerelease version for 
the upcoming 64-bit Itanium processor. 
These compilers can be used on 32-bit 
systems to create 64-bit executables. Each 
compiler supports new features such as 
autovectorization and OpenMP, which is 
designed to allow multiprocessor com- 
puter developers to use high-level direc- 
tives instead of a low-level operating sys- 
tem interface. Intel has also enhanced 
interprocedural optimization, vectoriza- 
tion, and profile-guided optimization in 
its compilers. Both compilers plug into 
the Microsoft Visual Studio development 
environment. The Intel C++ Compiler 5.0 
carries forward its source and object code 
compatibility with Microsoft Visual C++, 
and the Intel Fortran Compiler 5.0 con- 
tinues to be substantially source-code 
compatible with Compaq Visual Fortran. 

Intel Corp. 

2200 Mission College Boulevard 

P.O. Box 58119 

Santa Clara, CA 95052-8119 

503-264-1607 

http://www.intel.com/ 


Curious Networks has developed MAXML 
(short for Multi-Channel Access XML), a 
proprietary language for multichannel ap- 
plications development. MAXML’s goal is 
a write-once, run-anywhere language for 
mobile devices. MAXML is interaction ori- 
ented, modeling the relationships between 
data and user interactions to minimize the 
importance of the presentation layer. The 
platform is compatible with HTML, XML, 
WAP, and VoiceXML. MAXML and Curious 
Networks’ presentation server, Continu- 
um, are currently available in beta form. 

Curious Networks 

833 W. Jackson, Suite 400 

Chicago, IL 60607 

312-491-7500 

http://www.curiousnetworks.com/ 


Basis Technology has released Version 3.0 
of Rosette, a cross-platform library for Uni- 
code. Rosette now supports Unicode 3.0, 
the latest version of the global text pro- 
cessing standard. New features include 
support for additional character mappings 
(including the Euro character); improved 
support for MIME recognition; Unicode 
3.0-compliant sorting/collation for locale- 
specific sort orders; new character prop- 
erties to support correct processing of 
Asian and Middle Eastern text; character 
decomposition for unambiguous text 
searches; and improved speed in transcod- 
ing between different character sets. Ba- 
sis Technology is also offering Euclid, an 
engine that determines the language and 
data encoding of text from unknown 
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sources. When used in conjunction with 
Rosette, Euclid can automatically convert 
text from any source into Unicode. Rosette 
and Euclid are portable across all major 
hardware and operating-system platforms 
and are licensed royalty free in either bi- 
nary or full source code versions. 

Basis Technology 

One Kendall Square 

Cambridge, MA 02139 

617-386-2000 

http://www.basistech.com/ 


Cyrano has released the developer ver- 
sion of OpenSTA, an open-source dis- 
tributed software testing architecture based 
on CORBA. Using OpenSTA, a user can 
generate realistic heavy loads simulating 
the activity of hundreds to thousands of 
virtual users, whatever the development 
languages used. OpenSTA is licensed un- 
der the GPL and maintained on Source- 
Forge. Cyrano is offering consulting, 
training, and support services for the ar- 
chitecture. Cyrano has also upgraded two 
of its proprietary testing tools, Impact and 
Test. Impact 2.2.4 is a SQL transaction cap- 
ture/playback tool that finds performance 
bottlenecks through multiclient simula- 
tions. Test 5.4.2 is a character-based re- 
gression and load testing tool that uses 
scripts and distributed processing to test 
applications across an enterprise-wide IT 
network. 

Cyrano 

26 Parker Street 

Newburyport, MA 01950 

978-462-0737 

http://www.cyrano.com/ 


The most recent versions of ILOG’s opti- 
mization solving engines, CPLEX 7.0 and 
Solver 5.0, feature Concert, a new C++ op- 
timization modeling library and interface. 
Concert is designed to unify constraint 
programming and math programming. 
Concert also allows optimization models 
implemented in C++ to be expressed in a 
format familiar to operations research pro- 
fessionals. With Concert, CPLEX (which is 
C-based) can be used with C++, and op- 
timization models can be specified using 
object-oriented programming. Concert is 
also a part of new releases in the ILOG 
Optimization Suite. All products are avail- 
able for Windows, Solaris, Linux, and 
UNIX platforms. 

ILOG Inc. 

1080 Linda Vista Avenue 

Mountain View, CA 94043 

650-567-8000 

http://www.ilog.com/ 


The K & G Software Group is offering Hi- 


Clock Pro, a program designed to provide 
the exact time to all computers in a net- 
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work based on the time standard of the 
National Institute of Standards and Tech- 
nology (NIST). The program queries NIST 
timeservers using the Internet and adjusts 
the computer’s time to reflect NIST time. 
The host computer can serve as a time- 
server to any size network, and times can 
be checked as often or as seldom as de- 
sired. HiClock Pro includes two alarm 
clocks and customizable skins. A freeware 
version (which displays ads and cannot 
be used as a timeserver) is also available. 

K & G Software Group 

B. Khmelnitskogo, 8/1 

Mikhaylovka 

Volgograd Region, 403300 

Russia 

78446337691 

http://www.kgsoft.com/ 


Visual VCL Help from Cytron is a program 
for Delphi and C++ Builder that lets VCL 
component authors generate help files 
with a visual interface. Visual VCL Help 
employs a single project to simultaneous- 
ly generate both C++Builder and Delphi 
VCL help files. Additionally, it parses the 
relevant source files to automatically gen- 
erate Help topic pages as well as browse 
groups, pop up windows, and hyperlinks. 
Visual VCL Help’s internal dictionary, 
built through parsing of source files and 
manually added elements, holds classes, 
class methods, class properties and events, 
type descriptors, standalone procedures 
and functions, examples, and general help 
topics. 

Cytron Research Ltd. 

414A St. Vincent’s Street, 

Glasgow, Scotland G3 84N 

440-141-248-5186 

http://www.cytronresearch.co.uk/ 


DDC-I’s Very Smooth Migration (VSM) 
package offers a migration path for em- 
bedded system Ada developers switching 
from VAX to Solaris. The VSM package 
bundles either the DDC-I Ada Compiler 
System (DACS) or the Tartan Ada Devel- 
opment Systems (TADS) IDE for Solaris 
with a list of choices that includes on-site 
consulting to assist with rescripting, mem- 
ory and segment set up, tool adoption, re- 
lated Ethernet work and UCC upgrading. 
A final project report with recommenda- 
tions is included. 

DDC-I 

400 N. 5th Street 

Phoenix, AZ 85004 

602-252-6054 

http://www.ddci.com/ 


ActivePython 2.0 from ActiveState, a free 
binary distribution of Python for Linux, 
Solaris, and Windows is available for 
download. ActivePython includes pre- 
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compiled libraries and a number of 
Windows-specific extensions. Features of 
the 2.0 release include a cross-platform 
installer, integrated support for Windows 
APIs, online documentation, precompiled 
extension libraries, PythonCOM support, 
and the PythonWin development envi- 
ronment. ActiveState has also released 
PerlMx, a mail filter engine that operates 
as a plug-in to Sendmail products, of- 
fering mail archiving, volume account- 
ing, custom routing, spam control, key- 
word scanning, and content rewriting of 
messages. PerlMx is implemented at the 
server level and currently runs on Linux, 
Solaris, and HP-UX. 

ActiveState 

580 Granville Street 

Vancouver, BC 

Canada V6C 1W6 

604-484-6400 

http://www.activestate.com/ 


Catalyst Development has announced the 
release of SocketTools 3.5. Included in this 
version of the Winsock-compliant TCP/IP 
custom controls is full support for Uni- 
code; C++ class libraries with complete 
source; a new HTTP control and library; 
an enhanced FTP control and library, in- 
cluding built-in support for four proxy 
servers as well as a user-defined proxy 
type; an SMTP control and library that sup- 
ports advanced features of the ESMTP pro- 
tocol; improved MIME control with faster 
decoding and encoding; and new docu- 
mentation. 

Catalyst Development Corp. 

56925 Yucca Trail PMB 254 

Yucca Valley, CA 92284 

760-228-9653 

http://www.catalyst.com/ 


Oratrix Development has announced the 
availability of its GRiNS for SMIL 2.0 Play- 
er. The Player is intended to assist mem- 
bers of the World Wide Web Consortium 
and others who are evaluating the new 
SMIL 2.0 language profile. SMIL 2.0 in- 
cludes native support for transitions, ani- 
mation, event-based interaction, and ex- 
tended layout facilities. It also includes 
new timing and synchronization primi- 
tives. The GRiNS player provides a pre- 
view engine for Oratrix’s authoring and 
development systems; it is also available 
for RealSystem G2. 

Oratrix Development BV 

Valeriusplein 30 

Amsterdam 

The Netherlands 

31-20-679-5306 


http://www.oratrix.com/ 
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SWAINE’S FLAMES 


Saving Private Lyin’ 


ecently, I was working the late shift behind the bar at my favorite Silicon Valley hangout, Foo Bar, 
PRs Larry, Mo, and Curly Joe walked in. I’m sorry to say that I’ve run into people who question 

whether I actually work at Foo Bar, or whether there is such a place, or even whether these three 
journalists exist. All I can say is that some people have an overly literal view of the truth. I pick up so 
much juicy scuttlebutt on these Foo Bar nights that I long ago decided I'd gladly work there as often 
as possible, whether it was a real place or not. On this particular night, Mo, who currently reports for 
a San Jose-based daily that shall remain nameless, climbed up on a bar stool, ordered her new 
favorite drink, a Glenlivet Glen Ross, and announced, “I see IBM has hired themselves a CPO.” 

“Indeed? A Chief Petty Officer?” Larry asked, smiling as I poured his Chardonnay. 

“Chief Privacy Officer. It’s a way of saying they’re getting serious about privacy. Just spin, I’m sure, 
but I suppose now every dot-com will have to appoint a CPO.” 

“That reminds me of a story,” Joe said, and Mo and Larry moaned in unison while I placed a bottle 
of cream soda in front of him. “It was just after World War II,” he began. “A retired brigadier general 
moved into a palatial wooded estate on the edge of town. At the entrance to his long curving 
driveway, he erected a sign saying, General Arthur, Private Drive. Just down the road was a very 
different neighborhood: prefab houses designed for returning Gls. In front of the first of these modest 
dwellings, there promptly appeared a sign reading, Private Greenbaum, General Drive.” 

Mo squinted thoughtfully, then said, “Reader's Digest, ‘Humor in Uniform,’ circa 1962.” 

Joe said he thought it might have been a little earlier, and added that he thought there was a related 
joke in the IBM story, something about the difficulty of an officer dealing with private matters. Larry 
told him it needed work. Mo said it wouldn’t work. 

Larry lifted his glass to the light and gazed into the backlit wine, a habit of his when he’s 
formulating a thought. “Privacy and publicity,” he announced, “are but two sides of the lately 
devalued coin of repute.” 

“Tl buy that,” Joe said. “My first editor back in Eau Claire used to say there are just two kinds of 
people — those who want to get it in and those who want to keep it out.” 

“They're the same people,” Mo said. “Look at spinmeister Joe Lockhart. He went from keeping 
Clinton’s actions out of the news to getting Oracle’s in.” 

“Agreed,” Larry said, holding up his glass with a different intent. “And there are just three tools — 
thank you, Michael, nothing looks so lonely as an empty wineglass— three tools for controlling the 
flow of information— encryption, legislation, and spin.” 

“Sometimes,” Joe whined, “I think there’s too much whining about privacy.” 

Mo snapped, “Could you keep your thoughts about privacy to yourself?” Joe, offended, slipped off 
to the bathroom. 

“Take Carnivore, the FBI’s technology for scanning e-mail,” Larry went on. “The Carnivore story 
beautifully mixes encryption, legislation, and spin.” 

I mentioned a story I’'d read about a pocket lie detector. “Apparently it can detect lies over the 
phone, so it should be able to filter-out spin on television.” 

Larry didn’t see it that way. “The spinmeisters have that one beaten coming out of the gate. They 
know it’s easy enough to foil lie detectors: One simply never tells the truth. Lie detectors don’t find the 
truth, they only flag lies. If you always lie, they are rendered useless.” He brightened. “I suppose that 
means that perpetual lying is a powerful weapon against invasions of privacy.” 

Mo jumped back in. “Perpetual lying may be a powerful weapon against some invasions of privacy, 
but it doesn’t help when the invasion is physical.” 

“Like random drug testing?” 

“Right, although that at least has been ruled unconstitutional by the U.S. Supreme Court.” 

“When you're in your car. Not in the workplace.” 

“True. But when we are finally all doing all our work in our cars during day-long commutes, the 
car will be the workplace. That’s how we win. Gimme ‘nother, Mike.” 

Joe came back from bathroom just then and wanted to know why there was a computer screen 
displaying ads above the urinal. 

“That's a Viewrinal,” I explained. “The company making them figures that’s one place where they 
can count on a captive audience.” 

“Umph. Well, at least it’s just a downlink. It is one-way, isn’t it?” 


“For now,” I said. “It is for now.” 
f 


Michael Swaine 
editor-at-large 
mswaine@swaine.com 
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SECURITY WILL FULFILL 
YOUR DEVELOPMENT NEEDS. 


Ah, sweet mystery...how does one co-exist in the wired and wireless worlds? RSA 
Security has discovered the one true way: develop with RSA BSAFE WTLS and SSL 
security components, and you easily fulfill your need for security and true interoper- 
ability between your wired and wireless applications. With over half a billion copies of 
RSA BSAFE technology deployed by companies such as IBM, Nokia and Ericsson, your 
applications will exist in a state of complete global harmony — without extensive test- 
ing. So you save big on development time and expense, which does great things for 
your karma. Just what you'd expect from RSA Security, your source for authentication, 


encryption and PKI. 1.800.782.5453 
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Products displaying this symbol 


contain or are compatible with the 
SECURITY 


most trusted e-security technology 





The Most Trusted Name in e-Security” 


Qualify Now for a FREE RSA BSAFE SDK 


RSA, BSAFE, the RSA Secured logo, “The Most Trusted Name in e-Security” and 
the RSA Security logo are registered trademarks or trademarks of RSA Security 
Inc. All other trademarks are the property of their respective owners. ©2000 RSA 


Security Inc. All rights reserved. 
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Featuring 


VisualCafe StructureBuilder TopLink Macromedia Dreamweaver 


We've acquired and licensed some of the best technologies on the planet 
and created an integrated suite that can cut weeks, 
maybe even months, off your application develooment schedule. 


Introducing WebGain Studio.” 


It's a complete, end-to-end, browser-to-database Java development solution. 
And from design to code to deployment, there's nothing more powerful or productive. 


Check It out today at www.webgain.com. 


® GAIN’ 


©2000 WebGain, Inc. WebGain, WebGain Studio, VisualCafé, StructureBuilder and TopLink are registered trademarks of WebGain, Inc. 
Dreamweaver is a registered trademark of Macromedia, Inc. 





